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Sequence : 

Scoring table: 
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September 17, 2004, 16:42:05 ; Search time 80.461 Seconds 

(without alignments) 
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Gapop 10.0 , Gapext 0.5 
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1586107 seqs, 282547505 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : A__Geneseq_29 Jan04 : * 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 


RESULT 1 
ABG32539 

ID ABG32539 standard; protein; 138 AA. 
XX 

AC ABG32539; 
XX 

DT 15-NOV-2002 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 


Human CCR5-based scaffolded fusion protein #1, 


Scaffolded protein; CCR5; HIV; human immunodeficiency virus infection; 
ECD; extracellular domain; metal chelating motif; zinc finger protein; 
integral membrane protein; soluble loop; intracellular domain; ICD; 
KW gene therapy; immunogen; viral infection; human. 
XX 

OS Homo sapiens. 


OS Synthetic. 
XX 

PN WO200260477-A1. 
XX 

PD 08-AUG-2002. 
XX 

PF 29-JAN-2002; 2002WO-US002377 . 
XX 

PR 31-JAN-2001; 2001US-0265782P . 
PR 31-JAN-2001; 2001US-0265858P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Coleman TA, Mansfield B; 
XX 

DR WPI; 2002-643357/69. 
XX 
PT 


Novel scaffolded fusion polypeptide useful for therapeutic purposes or 
PT for screening molecules that bind/activate/inhibit/modulate the 
PT polypeptide, comprises a functional polypeptide domain fused to a 
PT scaffold domain. 
XX 

PS Example 1; Page 21; 64pp; English. 
XX 

CC The invention relates to a scaffolded fusion polypeptide comprising a 
CC functional polypeptide domain fused to a scaffold domain, where the 
CC functional polypeptide domain corresponds to a soluble loop of an 

CC integral membrane protein (e.g. human CCR5, a transmembrane receptor 

CC involved in HIV (human immunodeficiency virus) infection). Also included 

C< are; (1) a polypeptide comprising a scaffold domain; (2) a nucleic acid 

CC encoding the fusion polypeptide; (3) a vector cassette for the expression 

CC of the fusion polypeptide comprising an expression region operably linked 

CC to a promoter, where the expression region comprises a number of 

CC cassettes, each of which encodes a module, domain or strand of the fusion 

CC polypeptide and (4) a host cell comprising the vector or nucleic acid 
The fusion polypeptide is useful for screening molecules that 
bind/activate/inhibit/modulate the fusion polypeptide, by expressing the 
fusion polypeptide from and identifying a molecule that binds to the 

CC fusion polypeptide. The fusion polypeptide is useful in diagnostic 

CC methods, in assays to identify compounds that interact with loops of 

CC fragments of an extracellular domain (ECD) or an intracellular domain 

CC (ICD) or to rapidly assay the function of mutated portions of mutant 

CC integral membrane proteins without having to produce significant 

CC quantities of the entire mutant integral membrane protein, to generate 

CC antibodies that recognise the integral membrane proteins from which they 

CC are designed, to competitively bind the ligand of a naturally occurring 

CC receptor in vitro or in vivo, to display and/or screen soluble domains 

CC from protein such as integral membrane proteins, to probe the structure 

CC of ECD or ICD, or both, of an integral protein membrane, to modulate the 

CC activity of a receptor in vivo, and for treating or preventing viral 

CC infection, preferably human HIV infection e.g. by gene therapy using the 

CC encoding nucleic acid. The present sequence is a scaffolded protein based 

CC on the ECD region of human CCR5 (not defined) 
XX 

SQ Sequence 138 AA; 


CC 
CC 
CC 


Query Match 


100.0%; Score 797; DB 5; Length 138; 


Best Local Similarity 100.0%; Pred. No. 1.8e-75; 

Matches 138; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ^DYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCA^QWDFGNTMCQHQRVHGHHHHS 60 

1 I I I I I I I I I I I I I I I M M I | | | | | | | | M [ I I I I I M I I I | | | | | | | | | | | | | | | | | | 
1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 


Db 


Db 


Scaffolded protein; CCR5; HIV; human immunodeficiency virus infection- 
ECD; extracellular domain; metal chelating motif; zinc finger protein- 
integral membrane protein; soluble loop; intracellular domain; ICD; 
gene therapy; immunogen; viral infection; human. 


60 

QY 61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 120 

_ 1 1 I I I I I I I I I II I I I I I I I I I II I I I I I I | | | | | I I I I I I I I I I I I I I I I I I I | 

61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 120 

QY 121 NNCSSSNRLDGHQRVHAA 138 

I I I I I I I I I I I I I I I I I I 
Db 121 NNCSSSNRLDGHQRVHAA 13 8 

RESULT 2 
ABG32540 

ID ABG32540 standard; protein; 157 AA. 
XX 

AC ABG32540; 
XX 

DT 15-NOV-2002 (first entry) 
XX 

DE Human CCR5-based scaffolded fusion protein #2 
XX 
KW 
KW 
KW 
KW 
XX 

OS Homo sapiens . 
OS Synthetic. 
XX 

FH Ke Y Location/Qualifiers 
FT Peptide 1. .19 

FT /label= Signal_peptide 

FT Protein 20. .157 

FT /label= Mature scaffolded protein 

XX 

PN WO200260477-A1. 
XX 

PD 08-AUG-2002. 
XX 

PF 29-JAN-2002; 2002WO-US002377 . 
XX 

PR 31-JAN-2001; 2001US-02 657 82P . 
PR 31-JAN-2001; 2001US-0265858P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Coleman TA, Mansfield B; 
XX 

DR WPI; 2002-643357/69. 
XX 

PT Novel scaffolded fusion polypeptide useful for therapeutic purposes or 
PT for screening molecules that bind/activate/inhibit/modulate the 


PT polypeptide, comprises a functional polypeptide domain fused to a 

PI scaffold domain. 

XX 

PS Example 2; Page 41; 64pp; English. 
XX 

CC The invention relates to a scaffolded fusion polypeptide comprising a 

CC functional polypeptide domain fused to a scaffold domain, where the 

functional polypeptide domain corresponds to a soluble loop of an 

CC integral membrane protein (e.g. human CCR5, a transmembrane receptor 

CC involved m HIV (human immunodeficiency virus) infection). Also included 

CC are; (1) a polypeptide comprising a scaffold domain; (2) a nucleic acid 

CC encoding the fusion polypeptide; (3) a vector cassette for the expression 

CC of the fusion polypeptide comprising an expression region operably linked 

CC to a promoter, where the expression region comprises a number of 

CC cassettes each of which encodes a module, domain or strand of the fusion 

CC polypeptide and (4) a host cell comprising the vector or nucleic acid. 

5: The fusion polypeptide is useful for screening molecules that 

CC bind/actiyate/inhibit/modulate the fusion polypeptide, by expressing the 

CC fusion polypeptide from and identifying a molecule that binds to the 

CC fusion polypeptide. The fusion polypeptide is useful in diagnostic 

CC methods in assays to identify compounds that interact with loops of 

rr frrST \ an extracellular domain (BCD) or an intracellular domain 

CC 
CC 
CC 


/■rrn\ ~- (.„ .j, , ^4. -LHL.j_auej.xuiar aoman 

(ICD) or to rapidly assay the function of mutated portions of mutant 
integral membrane proteins without having to produce significant 
quantities of the entire mutant integral membrane protein, to generate 

, ~ antl ^ odles that recognise the integral membrane proteins from which they 

CC are designed, to competitively bind the ligand of a naturally occurring 
CC receptor in vitro or in vivo, to display and/or screen soluble domains 
tL irom n rnrp i n q n r-H c -; r-. +- ^ ~ ^ i 1 ... 

CC 
CC 
CC 


-p^™ i. , . ^ ^ ov-iccii buiuDie aomaxns 

from protein such as integral membrane proteins, to probe the structure 
of BCD or ICD, or both, of an integral protein membrane, to modulate the 
activity of a receptor in vivo, and for treating or preventing viral 
CC in^oH P ^ eferab1 ^ human HIV ^fection e.g. by gene therapy using the 

CC oTtt lrT a °t i PreS6nt Sec * uence is a scaffolded protein based 

on the E CD region of human CCR5 (not defined) 

XX 

SQ Sequence 157 AA; 

Query Match 100.0%; Score 797; DB 5; Length 157; 

Best Local Similarity 100.0%; Pred. No. 2.1e-75- 

Matches 138; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 

1 1 1 1 1 1 I M I I I I I I I I I M M 1 1 I I 1 1 I 1 1 I 1 1 | | | | | | 1 1 | | I 1 1 M 1 1 | 1 1 | | 1 1 | I 

MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 79 

YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 120 
III I I I I I I I I I I I I I I II I I I II I I I I I I | mm | | || || 

YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 139 

NNCSSSNRLDGHQRVHAA 138 
M I I II I I I I I | | | | | | I 


Qy 

1 

Db 

20 

Qy 

61 

Db 

80 

Qy 

121 

Db 

140 


RESULT 3 
AAW27407 
ID AAW27407 


standard; protein; 352 AA. 


XX 

AC 

XX 

DT 

XX 

DE 

XX 

KW 

KW 

KW 

KW 

KW 

XX 

OS 

XX 

PN 

XX 

PD 

XX 

PF 

XX 

PR 

PR 

XX 

PA 

XX 

PI 

XX 

DR 

DR 

XX 

PT 

PT 

PT 

XX 

PS 

XX 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

XX 

SQ 


AAW27407; 

14-APR-1998 (first entry) 
Human CCR5 . 

Human Cys-Cys chemokine receptor 5; CCR5; human immunodeficiency virus; 
type 1; type 2; HIV-1; HIV-2; diagnosis; treatment; prevention; 
xnflammatory disease; rheumatoid arthritis; glomerulonephritis; asthma; 
idiopathic pulmonary fibrosis; psoriasis; viral infection; cancer; 
atherosclerosis; autoimmune disorder. 

Homo sapiens. 
WO9732019-A2. 
04-SEP-1997. 

2 8-FEB-1997; 97WO-BE000023 . 


01-MAR-1996; 
06-AUG-1996; 


96EP-00870021. 
96EP-00870102. 


(EURO-) EUROSCREEN SA. 

Samson M, Parmentier M, Vassart G, Libert F; 

WPI; 1997-479829/44. 
N-PSDB; AAT90117. 

Active and inactive forms of human CC chemokine receptor CCR-5 - useful 
to diagnose, prevent and/or treat inflammatory disorders, autoimmune 
disease and viral infection. 

Claim 4; Fig lb-c; 94pp; English. 

The present sequence is human CC (Cys-Cys) chemokine receptor 5 (CCR5) 
which is stimulated by MIP-1 alpha, MIP-1 beta or RANTES chemokines, but 
not by monocyte chemoattractant protein 1 (MCP-1) MCP-2 MCP-3 
interleukin-8 (IL-8) or growth related gene product alpha (GRO alpha) 
chemokines. Active CCR-5 is also a receptor of human immunodeficiency 
virus type 1 or type 2 (HIV-1 or HIV-2). CCR5 or its cDNA can used to 
diagnose, treat and/or prevent inflammatory diseases, e.g. rheumatoid 
arthritis, glomerulonephritis, asthma, idiopathic pulmonary fibrosis and 
psoriasis, viral infections, especially HIV-1 or HIV-2 infection, cancer, 
atherosclerosis and autoimmune disorders 

Sequence 352 AA; 


Query Match 47.6%; Score 379; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 
Q y 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

M I M I I M I I I I I I I I I I M I I I I I I M I 

Db 1 MD YQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG— 64 

1 I I I I II I I I I I I I | : | : : | 

LKSMTDI YLLNLAI S DL FFLLT VP FWAHYAAAQWD FGNTMC — QLLT GL YFI GFFS GI FF 118 

— LCTRSQKEGLHYTC 78 
IILLTIDRYLAVVHAVFALKA 178 


!CWIUNr « iJjM HQRVHGG 105 

I I I I I I I I I I • ♦ I 


SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLI 


F 238 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I II I I : | 


RESULT 4 
AAW27123 

ID AAW27123 standard; protein; 352 AA. 


XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 


AAW27123; 

14-DEC-1997 (first entry) 
Human chemokine receptor 8 8C. 

Chemokine receptor 88C; atherosclerosis; rheumatoid arthritis; tumour; 
asthma; viral infection; AIDS; inflammation; autoimmune disease; therapy 
dragnosrs; leukocyte trafficking; G protein coupled receptor; ligand; 
modulator; antibody; human. 


Homo sapiens 
Key 

Domain 
Domain 
Domain 
Domain 
Domain 
Domain 
Domain 
Domain 


W09722698-A2 , 


26-JUN-1997. 


Location/Qualifiers 
1. .32 

/label= Extracellular_domain 
56. .67 

/label= Intracellular_domain 
89. .112 

/label= Extracellular domain 
125. .145 

/label= Intracellular_domain 
166. .191 

/label= Extracellular_domain 
213. .235 

/label= Intracellular_domain 
259. .280 

/label= Extracellular_domain 
301. .352 

/label= Intracellular domain 


PF 20-DEC-1996; 96WO-US020759 . 
XX 

PR 20-DEC-1995; 95US-00575967 . 

PR 07-JUN-1996; 96US-00661393 . 
XX 

PA (ICOS-) ICOS CORP. 
XX 

PI Gray PW, Schweickart VL, Raport CJ; 
XX 

DR WPI; 1997-341689/31. 

DR N-PSDB; AAT85161. 
XX 
PT 


New nucleic acid encoding chemokine receptors 88-2B and 88C - used to 

II t^n leUk ^ Cyt ! traffickin ^ e-g. for treatment of inflammation, 

tumours, viral infections, auto-immune diseases, etc. 


XX 

PS Claim 16; Page 47-48; 65pp; English. 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 

SQ Sequence 352 AA; 


This polypeptide sequence comprises novel human chemokine receptor 8 8C a 
P rotein coupled receptor that is involved in leukocyte traffickinq its 
ammo sequence was deduced from a cDNA clone (AAT85161) isolated from a 
macrophage library. It shows 62% identity to CCCKR1 . Chemokine receptor 
88-2B (see AAW27124) has also been identified. 88C and 88-2B Lceptors 
and their polypeptide fragments can be produced in transformed host 
cells. The receptors, peptides comprising one or more of the 

bf u^H 1 ^^ J ntracellular ^mains, and anti-receptor antibodies can 
be used to modulate receptor activities, particularly ligand and G 
protein binding, and are potentially potentially useful in the treatment 
of atheroslerosis, rheumatoid arthritis, tumours, asthma, viral 
infection AIDS, inflammatory conditions, pathological immune response, 
abnormal haematopoietic processes etc 


Query Match 47.6%; Score 379; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 

I I I M I I I I I I I I I | | M I I I I I I I I I | I I 


7; 

30 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 1 I I I I I I I II I I I I : I : : I 


.8 


LCTRSQKEGLHYTC 7 8 
IILLTIDRYLAWHAVFALKARTVTFG 178 


— — " ^v^^'mNr^iLM HORvrrnr i rm 

I Mil I II II || QRVHGG 


SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLI 

GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I II I I I I I I || | : | 

TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


F 238 


XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 

FT 

FT 

FT 

FT 

FT 

XX 

PN 

XX 

PD 

XX 

PF 

XX 

PR 

XX 

PA 

XX 

PI 

PI 

XX 

DR 


RESULT 5 
AAW23835 

ID AAW23835 standard; protein; 352 AA. 


AAW23835; 

08-JUN-1998 (first entry) 

Human CC chemokine receptor 5 (CCR5) . 

CC chemokine receptor 5; CCR5; G-protein coupled receptor; 

human rmmunodef iciency virus; HIV; CD 4 ; AIDS; therapy; transgenic animal. 
Homo sapiens. 

Ke y Location/Qualifiers 
Domain 29. .55 

/label= I 

/note- "transmembrane domain" 
104. .126 
/label= III 

/note- "transmembrane domain" 
109. .120 

/note= "extracellular loop-1 (Claim 19)" 
143. .171 
/label= IV 

/note= "transmembrane domain" 
187. .210 

/note= "extracellular loop-2 (Claim 19)" 
194. .219 
/label- V 

/note- "transmembrane domain" 
238. .258 
/label= VI 

/note- "transmembrane domain" 
261. .276 

/note= "extracellular loop-3 (Claim 19) " 
277. .300 
/label= VII 

/note= "transmembrane domain" 


Domain 

Region 
Domain 

Region 
Domain 

Domain 

Region 
Domain 

W09745543-A2 
04-DEC-1997. 
28-MAY-1997; 
28-MAY-1996; 


97WO-US009586, 
96US-0018508P. 


(USSH ) US DEPT HEALTH & HUMAN SERVICES. 

Combadiere C, Feng Y, Berger EA, Alkhatib G, Murphy P M - 
Broder CC, Kennedy PE; 

WPI; 1998-032650/03. 


DR N-PSDB; AAT76920. 
XX 

PT CC chemokine receptor 5 polypeptide - used to inhibit membrane fusion 

PT between HIV and a target cell 

XX 

PS Claim 68; Fig 1C; 70pp; English. 
XX 

CC This protein sequence comprises of a novel human macrophage-selective CC 

CC chemokine receptor that has been designated CCR5 . The sequence was 

CC tef W238340 of cct^ ^ (8e ° AAT16920 ^ ■ Alal27Leu variant 

OC (see W238340 of CCR5 was also identified. The susceptibility of human 

CC and r cCR5 9S ?r R S ^ ^ °" M Surfac « -P-ssion of CD4 

CO and CCR5. CCR5 is a member of the 7-transmembrane superfamily of G- 

CC Z°l ein T Pled 0611 SUrfaCS molecules - " plays an essential role in the 

CC TtJS?. fU !i~ '!!? !?,^f Ction by »«» HIV isolat -' ^e establishment 
CC 
CC 
CC 

CC ,„t potential ^faT^^^^^^^^ 
XX 


- .w„ ^ nJ . v liuidues. xne establishment 

of stable non-human cell lines and transgenic mammals having cells that 

Tnf7rr eSS 7:\ CD J CCR5 P r ° vides valu ^le tools for research of HIV 

infection. Antibodies that bind to CCR5, CCR5 variants, and CCR5-binding 
agents capable of blocking membrane fusion between HIV and target cells 

reDresent- nnfenHal ^fi.uTu *-i , . J-yci. <_ex_L£> 


SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7- 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ 30 

FG FVGNML VI LILINCKR 60 


MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFI 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 1 I I I I I I I I I I I I | : | : : | 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 78 

IILLTIDRYLAVVHAVFALKARTVTFGVVTSVITWVAVFASLPGIIFT^ 1 78 


I I I I I I I I I I I I I I I HQRVHGG 105 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I M I I II I I | | | | | | | | : | 


RESULT 6 
AAW88232 

ID AAW88232 standard; protein; 352 AA. 
XX 

AC AAW88232; 
XX 

DT 15-MAR-1999 (first entry) 
XX 


DE HIV-1 co-receptor CCR5 . 
XX 

KW HIV-1; CCR5; CCR5m303; co-receptor; infection; diagnosis; AIDS ; 

KW gene therapy; human. 

XX 

OS Homo sapiens. 
XX 

FH Ke y Location/Qualifiers 

FT Domain 32. .56 

FT /note= "transmembrane domain 1" 

FT Domain 67. .87 

Fl /note= "transmembrane domain 2" 

FT Misc-difference 101 

I! /note- "corresponds to TGT (Cys) in wild-type CCR5, TGA 

FT (Stop) in CCr5m303 M 

FT Domain 103. .124 

FT /note- "transmembrane domain 3" 

FT Domain 142. .167 

FT /note= "transmembrane domain 4" 

FT Domain 200. .223 

FT /note= "transmembrane domain 5" 

FT Domain 236. .260 

FT /note= "transmembrane domain 6" 

FT Domain 275. .301 

FT /note- "transmembrane domain 7" 
XX 

PN W09854317-A1. 
XX 

PD 03-DEC-1998. 
XX 

PF 29-MAY-1998; 98WO-EP003437 . 
XX 

PR 30-MAY-1997; 97US-0048057P . 
XX 

PA (MOND-) FOND MONDIALE RECH & PREVENTION SIDA. 
XX 

PI Beretta A, Quillent C, Arenzana Siesdedos F, Braun J; 
XX 

DR WPI; 1999-059835/05. 

DR N-PSDB; AAV84126. 
XX 

PT New CCR5 variant protein of the HIV-1 co-receptor - useful in developing 

PT resistance of CCR5-expressing cells to HIV-1 infection. P 9 

XX 

PS Disclosure; Page 34-35; 55pp; English. 
XX 

CC This is the amino acid sequence of wild- type human CCR5, which serves as 

£ ""°: re ^ pt0r for .infection by macrophage-tropic (M-tropic) strains of 

CC SSfl?J? e ^ nventl ° n / elate ^ to the identification of a CCR5 variant (see 

rr 2Sn ; dS ^ gnated CCR5 ^03, comprising the first two transmembrane 

CC domains of wild-type CCR5, but lacking transmembrane domains 3-7. The 

Cc noJiM ^ ? CCR5m303 variant with the wild type CCR5 allele shows a 

CC positive correlation with resistance to infection with M-tropic HIV-1 

CC strains, and may indicate slower progression of the disease. The 

rr H^ C oi° n ? ^ CR5 Variants ma ^ be used to identify individuals at lower 

rr miv °Lt r eC ^° n relative to the ^neral population who, if infected, 

CC may exhibit slower progression to AIDS. Probes and primers (see AAV84127- 


cc 
cc 
cc 


If ^ Pr ° Vld f f ° r USS ±n dia gnostic methods for detecting the presence 
of such varrants. A method is provided for inhibiting HIV-1 infection of 

, J^ 11 ex P— ng the CCR5 receptor. This involves introducing a nucleic 

CC acid encoding a CCR5 variant into the cell, thereby reducing ?he number 
CC of functional CCR5 molecules present on the cell surface 


XX 

SQ Sequence 352 AA; 


Query Match 4 7.6%; Score 379; DB2; Length 352; 

Best Local Similarity 35.0%; Pred. No 3 2e-31- 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


Qy 

l 

Db 

l 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 

I I I I I M I I I I I I I I | | i i || i i i i 


30 


<R 60 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 1 I I I I I I I I I I I I | : | : : | 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAA.QWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 78 
IILLTIDRYLAWHAVFALKARTVT FGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 


■ I I I I I I I iTl I I I I I iTTt ■" HQRVHGG 105 


:F 238 


GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I M I I I I I I I I | : | 

TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 7 
AAE07048 

ID AAE07048 standard; protein; 352 AA. 
XX 

AC AAE07048; 
XX 

DT 16-OCT-2001 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN WO200158916-A2 
XX 


Human G-protein chemokine receptor (CCR5) HDGNR10 protein #2. 

Human; G-protein chemokine receptor; CCR5; HDGNR10; inflammation; HIV; 
human immunodeficiency virus; antimicrobial; vasodilator; vulnerary; 
cytostatic; immunosuppressive ; nootropic; neuroprotective; gene therapy; 
neurodegenerative disorder; Kaposi's sarcoma; autoimmune disease; 7 
rheumatoid arthritis; cancer; breast; ovary; adrenal gland; bone marrow; 
?*™ T? nal tract; lung; liver; immune disorder; Addison's disease; 

haemolytic anaemia; autoimmune thyroiditis; diabetes mellitus; allergy; 
multiple sclerosis; ulcerative colitis; Crohn's disease; wound healing; 
cardiovascular disorder; myocardial ischaemia. 


PD 16-AUG-2001. 
XX 

PF 09-FEB-2001; 2001WO-US004153 . 
XX 

PR 09-FEB-2000; 2000US-0181258P . 

PR 09-MAR-2000; 2 000US-0187999P . 

PR 22-SEP-2000; 2000US-0234336P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Roschke V, Li Y, Ruben SM; 
XX 

DR WPI; 2001-488966/53. 

DR N-PSDB; AAD13299. 
XX 
PT 
PT 


CC 

cc 

CC 
CC 
CC 
CC 
CC 


^°«? ed nucleic acid encoding a human G-protein chemokine receptor 
( CCR5 HDGNRIO polypeptide, useful for preventing or treating autoimmune 

PT diseases e.g. rheumatoid arthritis, hyperprolif erative disorders and 

PT neurodegenerative disorders. 
XX 

PS Example 40; Page 504-505; 518pp; Enqlish 
XX 

CC The invention relates to human G-protein chemokine receptor (CCR5) 

CC HDGNRIO polypeptides and polynucleotides. CCR5 HDGNRIO antibodies are 

CC useful for treating, preventing or ameliorating a disease or disorder 

CC ~? r"™, i nflarranation ' defective or aberrant chemotaxis of immune 

CC cells, HIV rnfectron (such as Pneumocystis carinii pneumonia or Kaposi's 

CC sarcoma) or defective or aberrant T-cell antigen presenting cell 

CC interaction The disease or disorder may also be an infectious disease 

u~ (e.g. a viral infection such as an early stage HIV infection, a 

CC cytomegalovirus infection, or a poxvirus infection), an autoimmune 

CC disease (e.g. rheumatoid arthritis) or a neurodegenerative disorder. The 

CC tVrrlt 7 d i SOrder u may be associ * ted with aberrant CCR5 expression, lack 
CC of CCR5 Unction, aberrant CCR5 ligand expression, or lack of CCR5 ligand 
CC function. CCR5 HDGNRIO protein is used as a food additive or preservative 
CC to increase or decrease storage capabilities. CCR5 HDGNRIO DNA are useful 
for chromosome identification and in gene therapy. CCR5 HDGNRIO DNA 
protein, antibodies, agonists and antagonists are also useful in the 
diagnosis, treatment and prevention of cancer (breast, ovary, adrenal 
gland, bone, bone marrow, gastrointestinal tract, liver lung 
urogenital); immune disorders (Addison's disease, allergies, autoimmune 
haemolytic anaemia, autoimmune thyroiditis, diabetes mellitus, Crohn's 
disease, multiple sclerosis, rheumatoid arthritis and ulcerative colitis) 
; cardiovascular disorders (myocardial ischaemias) and wound healing. The 
CC present sequence is human CCR5 HDGNRIO protein 
XX 

SQ Sequence 352 AA; 

Query Match 4 7.6%; Score 379; DB 4 ; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31- 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA . 30 

M I I I I I I I I I I I I I I I I I | | | ! | | | | | | | 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 
Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 


Db 

Qy 65 


«i ™ ' ' I M I I I I I I I I I | : | : : | 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF Hf 


LCTRSQKEGLHYTC 78 

Db 119 I ILLTI DRYLAVVHAVFALKARTVT FGWTS VI TWWAVFAS LP GI I FTRSQKEGLH YT C 178 

QY 79 SSHF ?YSQYQFWKNFQTLKI HORVHrr in* 

I M I I I I I I I I I I I I II I I I HQRVHGG 105 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

nK oo. 1 1 I I I I I I I I I I I I I I I I : | 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


XX 
PS 
XX 

cc 
cc 


inl^™ V f dldgnOS1S; colorectal, prostatic; organ rejection; 

inflammation; autoimmune disease; metastasis; bronchial asthma; lupus- 
chronic bowel inflammation; rheumatoid arthritis; cytostatic 
antiinflammatory; antiasthmatic; immunosuppressive; dermatoloqical - 
antirheumatic; antiarthritic . ^oiogicax, 


RESULT 8 
AAG80111 

ID AAG80111 standard; protein; 352 AA. 
XX 

AC AAG80111; 
XX 

DT 17-JAN-2002 (first entry) 
XX 

DE Human CCR5 protein. 
XX 
KW 
KW 
KW 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN WO200172830-A2. 
XX 

PD 04-OCT-2001. 
XX 

PF 02-APR-2001; 2001WO-EP003708 . 
XX 

PR 31-MAR-2000; 2000DE-01016013 
XX 

PA (IPFP-) IPF PHARM GMBH. 
PA (FORS/) FORSSMANN U. 
XX 

PI Forssmann W, Adermann K, Heitland A, Spodsberg N; 

AA 

DR WPI; 2001-626256/72. 
XX 
PT 


Diagnostic agent containing two or more receptor-specific liqands useful 
It i^Mtlrs" 5 tUrn ° rS ' infl ™ tion "to., ^so therapeutic usTof 


Disclosure; Page 10; 26pp; German. 

two S diSrenrv eSC ^ b ? S * n ° V61 dia 5nostic agent (A) comprising at least 
two different l lgan ds (I) for receptors (II) that are implicated in 


cc 
cc 
cc 
cc 
cc 


CC ; i^" 6 USSd f ° r the di *9™sis of tumors (especially colorectal 

CC Ms °T? T f e3eCtion ' inflammation and autoimmune diseases? 

~ ^ hlb |tors of (I) are used therapeutically against tumors (and their 

nflan,m ! tl0n fP-rticularly bronchial asthma or chronic bower 
whfreThf? ' H° r , T Une diseases (rheumatoid arthritis or lupus), 
exocrine VaSCUlar ' ^hatic, respiratory, nervous, digestif, 

endocrine, motor or urogenital systems or skin are affected, and bone 

CC ^lT h tT a T\ Th , e Pr ° dUCtS ° f thS inVenti ° n chemokine derivatives 

cc 7™ cytostatic, antiinflammatory, antiasthmatic, 

CC immunosuppressive, dermatological, antirheumatic, antiarthritic 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 4 ; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3 2e-31- 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 ; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA. 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 30 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 

LCTRSQKEGLHYTC 78 

Db 119 1 1 LLT I DRYLAVVHAVFAL KARTVT FGWT S VI TWWAVFAS LPGIIFTRSQKE GLH YTC 178 

QY 79 SSHFPYSQYQFWKNFQTLKI HORVHrr 

I I I I I I I I I I I I I I I I I I I I HQRVHGG 105 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

Db 239 T IMI VYFL FWAP YNI VLLLNTFQEFFGLNNCS S SNRLDQAMQV 281 

RESULT 9 
AAE04321 

ID AAE04321 standard; protein; 352 AA. 
XX 

AC AAE04 321; 
XX 

DT 04-SEP-2001 (first entry) 
XX 

DE Human chemokine receptor (CKR) , CC-CKR-5 related protein #2. 

KW Human; transformed mammalian cell; CD4; reporter aer^- hr»nci 

KW humsn n Tnm i in a ■ • ' ^ orcer gene, translocation; 

CC CKR ~ 5 ; envelope glycoprotein; anti-HIV. 

XX 


OS Homo sapiens . 
XX 

PN US6258527-B1. 
XX 

PD 10-JUL-2001. 
XX 

PF 21-MAY-1997; 97US-0 08 611 05 . 
XX 

PR 20-MAY-1996; 96US-0017157P . 
PR 19-JUN-1996; 96US-0020043P . 
PR 19-MAY-1997; 97US-00858660 . 
XX 

PA (AARO-) AARON DIAMOND AIDS RES CENT. 

PA (UYNY ) UNIV NEW YORK STATE 

XX 

PI Littman DR, De.ng H, Ellmeier W, Landau NR, Liu R; 
XX 

DR WPI; 2001-417127/44. 
DR N-PSDB; AAD08577. 
XX 
PT 
PT 

PT HIV 
XX 

PS Disclosure; Col 47-50; 37pp; English. 
XX 


Transformed mammalian cell (I) that contains a CD4 gene, reporter aene 
and HIV LTR for identification of drugs and antibodies for treatment of 


The present invention relates to a transformed mammalian cell that 
the re^l V 6116 ° M ' * —tract encoding a reporter gene under 

repeat fiTRranH ^r. r^K 1 ^ 110 def±cienc y (HIV) long terminal 

repeat (LTR) and that has been transduced with a vector encoding a human 
chemokme receptor (CKR) where the CD4 and the CKR are present on the 
• d 6 7 transformed mammalian cell. The invention is useful for 

HIV intoT T Z antib ° dieS that interfere with the translocation of 
receptor that^n VT^" 1 . C f 1 ° r ^ id -tifying a human chemokine 
receptor that facilitates the infection of a particular HIV strain into 
the transformed mammalian cell. Compounds identified can be used to treat 
cellular dysfunction and to prevent or combat HIV infection. The present 
V^t^l 13 * human chemokine receptor (CKR), CC-CKR-5 related protein. 
CC-CKR-5 is the principal cofactor for entry mediated by the envelope 
glycoproteins of primary macrophage-tropic strains of HIV-1 


CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 

SQ Sequence 352 AA 


Query Match 47.6%; Score 379; DB 4 ; Length 352; 

Best Local Similarity 35.0%; Pred. No 3 2e-31- 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Q y 1 MDYGVSSPIYDINYYTSEPCQKINVKQIAA. 

I I I I I I I II I II I I III I I I I I I I I I I I | | 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG— 64 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC-QLLTGLYFIGFFSGIFF 118 


Db 

Qy 65 


-LCTRSQKEGLHYTC 78 
: I I I I I I I I I II I 


Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


IILLTIDRYIAVVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 
SSHFPYSQYQFWKNFQTLKI HQRVHGG — 105 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I II I I I I I I I I I I I I I I : | 

TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 10 
AAE07039 

ID AAE07039 standard; protein; 352 AA. 
XX 

AC AAE07039; 
XX 

DT 16-OCT-2001 (first entry) 
XX 
DE 
XX 
KW 
KW 


Human G-protein chemokine receptor (CCR5) HDGNR10 protein #2. 


Human; G-protein chemokine receptor; CCR5; HDGNR10; inflammation; HIV" 
human immunodeficiency virus; antimicrobial; vasodilator; vulnerary; ' 
KW cytostatic; immunosuppressive; nootropic; neuroprotective; gene therapy; 
KW neurodegenerative disorder; Kaposi's sarcoma; autoimmune disease; **' 
rheumatoid arthritis; cancer; breast; ovary; adrenal gland; bone marrow; 
gastrointestinal tract; lung; liver; immune disorder; Addison 's disease 
naemolytxc anaemia; autoimmune thyroiditis; diabetes mellitus; allergy 
multiple sclerosis; ulcerative colitis; Crohn's disease; wound healing,' 
cardiovascular disorder; myocardial ischaemia. 


KW 
KW 
KW 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN WO200158915-A2. 
XX 

PD 16-AUG-2001. 
XX 

PF 09-FEB-2001; 2 001WO-US 004 152 . 
XX 

PR 09-FEB-2000; 2 00 OUS-018 1258 P . 
PR 09-MAR-2000; 2000US-0187999P . 
PR 22-SEP-2000; 2000US-0234336P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Roschke V, Li Y, Ruben SM; 
XX 

DR WPI; 2001-488965/53. 
DR N-PSDB; AAD13198. 
XX 
PT 
PT 


(rrl^ ti n 1 ' f Cld encodin 9 a h ™ G-protein chemokine receptor 
PT £«f Lf Polypeptide, useful for preventing or treating autoimmune 

PT diseases e.g. rheumatoid arthritis, hyperprolif erative disorders and 
PT neurodegenerative disorders. 


PS Example 40; Page 486-487; 495pp; English. 
XX 


CC proteir^CRrSDGNRlo'an^TH-^^ 0 ' 6111 Chem ° kine "ceptor (CCR5, HDGNR10 
CC \ HDGNR10 antibodies are useful for treating, preventing or 

CC defect^ 9 a K dlSea ! e ° r disorder associated with inflation, * 

CC Pnfumocvltis" -hemotaxis of immune cells, HIV infection (such as 

CC abJrranri -n"^- PneUrU ° nla ° r Kaposi's sarcoma) or defective or 

CC aberrant T-cell antigen presenting cell interaction. The disease or 

CC disorder may also be an infectious disease (e.g. a viral infection such 

CC .I" earl V ta 9 e HIV -fection, a cytomegalovirus infection or a 

CC a nlurodeaSfr^'H" T°™ ne ^.g. rheumatoid arthritis) or 

CC witrabfrrant ?CR5 e e^ S ° ^ ° r d±S ° rder ™* be associated 

ll with aberrant CCR5 expression, lack of CCR5 function, aberrant CCR5 

CC ligand expression, or lack of CCR5 ligand function. CCR5 HDGNR10 protein 
CC storaaf ? V ?■? • additi ™ °' tive to increase or decrease 

CC ident?fi" a f h SS - CCR5 HDGNR1 ° ° NA are USeful for chromosome 

rr ld f^ification and in gene therapy. CCR5 HDGNR10 DNA, protein 

CC trea^eiri ag ° nistS and antagonists are also useful in the diagnosis, 
CC treatment and prevention of cancer (breast, ovary, adrenal gland bone 

CC di°^rSr s r0 T^H gaStr ? in ^ Stinal traCt ' liVer ' urogenital) ; immune ' 

CC tul^ (Jddisons disease, allergies, autoimmune haemolytic anaemia 

CC autoimmune thyroiditis, diabetes mellitus, Crohn's disease^ multiple 

CC dS er H S1S \ rheUma ^° id arthritis and ulcerative colitis); cardiovascular 

CC disorders (myocardial ischaemias) and wound healing 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3 2e-31- 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ~ ~ J ° 

MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

' I I I I I I I I I I I I : I : : | 


LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC- 


-QLLTGLYFIGFFSGIFF 118 


-LCTRSQKEGLHYTC 78 

' I I I I I I I I II I I 
"IFTRSQKEGLHYTC 178 


Ml I IN I MINIMUM HQRVHGG- 105 


-GGS YKCGLC QEFFGLNNCSS SNRLDGHQRV 135 

I I I M I I I I I M | M I I I : | 


RESULT 11 
AAB46858 

ID AAB46858 standard; protein; 352 AA. 
XX 

AC AAB46858; 
XX 

DT 16-AUG-2001 (revised) 


DT 02-AUG-2001 (revised) 

DT 04-MAY-2001 (first entry) 

XX 

DE Human HDGNR10 protein. 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 


HDGNR10; human; G-protein chemokine receptor; antiinf lavatory ; 
xmrnunomodulatory; anticoagulant; antiallergic; immunosuppressive; 
cytostatic; antiparasitic; antipsoriatic; antirheumatic; antiarthritic; 
vasotroprc; gene therapy; haematopoiesis ; wound healing; coagulation; 

J-ceir^d"; ;° 1X ? tUm ° Ur '" infeCtion '' leukemia; growth factor activity; 
T cell mediated autoimmune disease; psoriasis; allergy; atherogenesis • 
anaphylaxis ; malignancy; inflammation; histamine; Iglf silicosC; shock- 
KW immunoglobulin E-mediated allergic reaction; rheumatoid arthriti ; ' 
P^staglandm-mdependent fever; bone marrow failure; sarcoidosis' 
hyper-eosmophilic syndrome; vulnerary. 


KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN US2001000241-A1. 
XX 

PD 12-APR-2001. 
XX 

PF 29-NOV-2000; 2 00 OUS-007252 8 5 . 
XX 

PR 06-JUN-1995; 95US-004 66343 . 
PR 18-NOV-1998; 98US-00195662 . 
PR 25-JUN-1999; 99US-00339912 . 
XX 

PA (LIYY/) LI Y. 
PA (RUBE/) RUBEN S M. 
XX 

PI Li Y, Ruben SM; 
XX 

DR WPI; 2001-226317/23. 
DR N-PSDB; AAF26390. 
XX 
PT 
PT 
XX 

PS Claim la; Page 15; 22pp; English. 
XX 

CC m S aiuXv'dL^T^r a - n ° Vel reCept ° r P ol »^»tlde (I) selected from 
CC sni^ff J ! , f amln0 aC±d Se ^ uence (") fully disclosed in the 

CC ola Sm fd 117* ^ 3 P ol ^ e P tide encod ed by the cDNA contained in a 

CC plasmid, and fragments, analogs and derivatives of the polypeptide. The 
CC products of the invention have antiinflammatory, immunomodulatory? 
CC anticoagulant, antiallergic, immunosuppressive, vulnerary, cytostatic, 
CC parasxtxc, antipsoriatic, antirheumatic, antiarthritic and vasotr pic 

CC activity and can be used for gene therapy. The G-protein chemokine 

CC S I"' HDG ? R i 0 ' (I> US6fUl f ° r SCreen ^ for compounds which 

CC or inhibit activation of (I). The products of the invention can 

CC US l r stlmulatin 9 haematopoiesis, wound healing, coagulation, 

CC angiogenesis treating solid tumours, chronic infections, leukemia, T- 
CC cell mediated autoimmune diseases, parasitic infections, psoriasis, and 
CC stimulating growth factor activity. HDGNR10 is useful for treating 
cr ^ r 9V' serogenesis, anaphylaxis, malignancy, chronic and acute 

inflammation, histamine and immunoglobulin E (IgE) -mediated allergic 


usefuHo G ~ P /°t ei , n . Chem ° kine rece P to ^ Polypeptides and polynucleotides, 
useful for identifying (ant) agonists to the G-protein chemoJine receptor. 


M 1 1 1 1 1 I I I I | | | | 1 1 | 1 1 | " 30 


CC ' P" st aglandin-independent fever, bone marrow failure, 

CC eo i 3 ;- sarcoidosis ' rheumatoid arthritis, shock and h^er 

CC eosinophilic syndrome. (N.B. This record was resubmitted to correct 

CC errors in the keyword formatting) correct 

XX 

SQ Sequence 352 AA; 

Bes^LoL'f,- 47 ' 6%; Sco " 379 ' DB4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 
Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

Db 61 LKSMTDIYLLNIAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 

LCTRSQKEGLHYTC 78 

Db 119 LLT I DRYLAWHAVFALKART VTFGWT S VI TWWAVFAS LPGI I FTRSQKEGLHYTC 178 

QY 79 SSHFPYSQYQFWKNFQTLKI Hnmraw 

I I I I I I I I I I I I I I I I I I I | HQRVHGG 105 

Db 179 SSHFPYSQYQFWI<NFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLioAMQV 281 

RESULT 12 
ABB56342 

ID ABB56342 standard; protein; 352 AA. 
XX 

AC ABBS 63 42; 
XX 

DT 18-FEB-2002 (first entry) 
XX 
DE 
XX 
KW 
KW 
XX 

OS Homo sapiens. 
OS Synthetic. 
XX 

PN WO200177172-A2. 
XX 

PD 18-OCT-2001. 
XX 


Non-endogenous human GPCR protein, SEQ ID NO: 477. 

Human; G protein-coupled receptor; GPCR; non-endogenous; mutant- 
constxtutrvely activated GPCR; agonist; disease. ' 


PF 05-APR-2001; 2001WO-US011098 
XX 

PR 07-APR-2000; 2000US-0195747P 
XX ' 

PA (AREN-) ARENA PHARM INC. 


XX 

PI Lehmann-Bruinsma K, Liaw CW, Lin I; 
XX 

DR WPI; 2001-648759/74. 
DR N-PSDB; ABI97978. 
XX 

S dileasf ireatS 3 ' 3 ^ ° * rot *^-°™*^ receptors (GPCRs) for use in 
PT of llcls ' C ° mpriSeS contac ting candidate compounds with versions 

XX 

PS Claim 1; Page 277-278; 394pp; English. 
XX 

cc 
cc 
cc 
cc 
cc 
cc 

rr -=, — -- ^ au ^ers associated with GPCRs. The 

present seaupnrp ■» « t-,^^-^^^^ ■ „ XA1C 

XX 

SQ Sequence 352 AA; 


The invention relates to G protein-coupled receptors (GPCRs ) for which 
the endogenous ligand has been identified. Non-endogenous constitutive! v 
direct H V !- ion !. of k — GPCRs are used in the invention for the 
direct identification of candidate compounds as receptor agonists 
inverse agonists or partial agonists. Such agonists are useful as 
therapeutic agents for diseases or disorders associated with GPCRs 
present sequence is a non-endogenous version of a known human GPCR* 


Query Match 47.6%; Score 379; DB 4; Length 352; 

Best local Similarity 35.0%; Pred. No. 3 2e-31- 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


30 


Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFI 


Q y 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 

MINI I I I I I I I I I I I I I l l l 1 1 

FGFVGNMLVI LI LINCKR 60 

QY 31 YKCGLC AAAQWD FGNTMCQHQRVH GHHHHS YKCG 64 

' ' II I I I I I I I I I I I • I • -I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


Db 61 
Qy 65 


-LCTRSQKEGLHYTC 78 

Db 119 TTT.T.TTnRVTmmM^T^n™™^^. __ _ MINIMUM 

178 

HQRVHGG 105 


119 IILLTIDRYLAv^VFALKARTVTFGVl^SVITWWAVFASLPGliFTRSQKEGLHYTC 
Qy 7 9 SSHFPYSQYQFWKNFQTLKI _ 


M I II II I II I I I I I I M I 

\T~T T Y^T T TT TiT T T rvn tt >-»t t n « -1- - _ _ 

:F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 


Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAKRLI I 


M I I II I I I 

281 


Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 

RESULT 13 
AAB83354 

ID AAB83354 standard; protein; 352 AA. 
XX 

AC AAB83354; 
XX 

DT 09-OCT-2001 (first entry) 
XX 

DE Human CCR5 protein sequence 
XX 


KW Chemotactic chemokine receptor 5; gpl20; CD4 ; therapy; HIV; CCR5 ; 

KW human immunodeficiency virus; anti-inflammatory disease; human. 

OS Homo sapiens. 
XX 

PN EP1118858-A2 . 
XX 

PD 25-JUL-2001. 
XX 

PF 03-JAN-2001; 2001EP-00300020 . 
XX 

PR 12-JAN-2000; 2000GB-00000659 . 

PR 12-JAN-2000; 2000GB-00000661 . 

PR 12-JAN-2000; 2000GB-00000663 . 
XX 

PA (PFIZ ) PFIZER LTD. 

PA ( PFIZ ) PFIZER INC. 
XX 

PI Dobbs S, Perros M, Rickett GA; 
XX 

DR WPI; 2001-477088/52. 

DR N-PSDB; AAF87099. 
XX 
PT 
PT 


Claim 1; Page 110; 113pp; English. 

This sequence represents the human CCR5 protein sequence. The invention 
relates to a method for determining whether an agent is capable of 
modulating the interaction of chemotactic chemokine receptor 5 (CCR5) 
with gP 120, comprising incubating the agent with CCR5 and g P 120 and 
determining whether the agent modulates the interaction, where gP 12 0 is 

?he°m^hod °Tl ^ interaction i« a low af f initfbinding . 

The method is used to identify an agent capable of modulating the 
interaction of CCR5 with gp!20. An agent identified by the method is used 
to prepare a pharmaceutical composition for the treatment of a disease or 
condxtron associated with CCR5 and gp!20 interaction, to treat a subject 
with a disease or condition associated with CCR5 and g P 120 interaction 
^nf°™? Pa ^ ng a ^ armaceutic *l ^r treating human irmnunodef iciency' 
virus (HIV) it can also be treat anti-inflammatory diseases. The method 
is commercially useful, amenable to high throughput screening, and 
detects interaction of gpl20 with cells expressing only CCR5 


Query Match 47.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

QY 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA on 

II I I I I M I II ! I I f I t I M I I I I 

Db 1 MDYQVS S PI YDINYYT S EPCQKINVKQIAARLLPPLYS LVFI FGFVGNMLVI LI LINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 I I I I I I I I I I I I I I : | : : | 


Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 
Qy 65 

LCTRSQKEGLHYTC 78 

Db 119 IILLTIDRYLAVVHAVFAL 1?8 

Qy 79 SS HFPYSQYQFWKNFQTLKI HORVHGC 

I I I I I I I I I I I | | | I I I I I I HQKVHGG - 105 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRL^ 281 


Human HIV-1 co-receptor CCR5 . 

CCR5; chemokine; co-receptor; human immunodeficiency virus type 1- HIV-1- 
infection; therapy; vaccine; anti-HIV-1. ' 


RESULT 14 
AAB82948 

ID AAB82948 standard; protein; 352 AA. 
XX 

AC AAB82948; 
XX 

DT 21-DEC-2001 (first entry) 
XX 
DE 
XX 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

FH Ke V Location/Qualifiers 
FT Binding-site 2. .18 

FT /note= "binds to HIV-1 gpl20" 

XX 

PN WO200164710-A2. 
XX 

PD 07-SEP-2001. 
XX 

PF 28-FEB-2001; 2001WO-US006699 . 
XX 

PR 29-FEB-2000; 2000US-0185667P . 
PR 19-MAY-2000; 2000US-0205839P . 
PR 07-FEB-2001; 2001US-0267231P . 
XX 

PA (PROG-) PROGENICS PHARM INC. 

PA (AARO-) AARON DIAMOND AIDS RES CENT 

XX 

PI Dragic T, Olson WC; 
XX 

DR WPI; 2001-611273/70. 
DR N-PSDB; AAH26903. 
XX 
PT 
PT 


Novel compounds comprising specific amino acids within CCR5 (HIV 1 co- 
PT ^u^fat^^^ ln0 termin ^ d ° main including negatively charged and two 
II humans tyr ° Slne resid ^ is useful for treating HIV infection in 


XX 


PS Claim 1; Page 30; 163pp; English. 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

CC infected with, but has' been exposed to'/ HIV 
XX 

SQ Sequence 352 AA; 


^-IS^n^f r? UenCe ' S that ° f hUman B^" 1 "-receptor CCR5 . Amino acids 
2 18 in the N-terminal region of CCR5 comprise an HIV-1 ODl20-bindina 

S^l^rjST ! PeCi f iCi ^ ° f interaction oSJL^Sp^ann 

CCR5 \T e r 'JuTt Slat i°^ al Su l fation of the tyrosine residues in the 

the susceotibiiitv ST i- , ? ? and may ""Idly modulate 

: 6 susceptibility of target cells to HIV-1 infection in vivo. The 

baled* oTtlTcitt\T imed T lfated Peptid6S (3ee ^2947) that are 
based on the CCR5 N-terminal region and which are effective for 

inhibiting HIV-1 binding to CCR5 . These peptides are used in cLi ™>H 

methods of inhibiting HIV infection of CD4 + cells, "f preventing CD4 + 

cells from becoming infected with HIV, of treating a sub j ect whose CD4 + 

cells are infected with HIV, and of identifying an agent which inhibits 

binding of a CCR5 ligand to a CCR5 receptor! The methods may be carr^d 

infected Ttl^V T^'I^V ^ P^i/methodf not 

infectS : ^! ! C *f_ mSt ; h ° d) ' °J_ in * -hject who is not 


Query Match 47.6%; Score 379; DB 4 ; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3 2e-31 • 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


MDYQVSSPIYDINYYTSEPCQKINVKQIAA 

I I I I M 1 1 1 1 1 I 1 1 I I I 1 1 I I | | | | 1 1 | | | " 30 

MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG- — 64 

LKSMTDIYLLNIAISDLFFLLTVP™AHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 78 

I ILLTIDRYIAVVIIAVFALKARTVTFGVVTSVITWVVAVFASLPGI I FTRSQKEGLHYTC 178 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 iTi 1 1 1 HQRV * GG ~ — 105 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I | | : | 

TIMI VYFLFWAPYNI VLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 8 1 


RESULT 15 
AAU97152 

ID AAU97152 standard; protein; 352 AA. 
XX 

AC AAU97152; 
XX 

DT 13-AUG-2002 (first entry) 
XX 
DE 
XX 
KW 


Human G-protein chemokine receptor (CCR5) HDGNR10 #2. 
Human; G-protein chemokine receptor; CCR5; HDGNR10 ; inflammation; 


immune cell chemotaxis; autoimmune disease; rheumatoid arthritis; 
neurodegeneration; viral infection; Kaposi sarcoma; cancer; 
hyperprolxferatxve disease; neurological disease; receptor. 


KW 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN US2002048786-A1. 
XX 

PD 25-APR-2002. 
XX 

PF 09-FEB-2001; 2001US-00779879 . 
XX 

PR 09-FEB-2000; 2 000US-0181258P . 

PR 09-MAR-2000; 2000US-0187999P . 

PR 22-SEP-2000; 2000US-0234336P ! 
XX 

PA (ROSE/) ROSEN C A. 

PA (ROSC/) ROSCHKE V. 

PA (LIYY/) LI Y. 

PA (RUBE/) RUBEN S M. 

XX 

PI Rosen CA, Roschke V, Li Y, Ruben SM; 
XX 

DR WPI; 2002-434754/46. 

DR N-PSDB; ABK51870. 
XX 

II rhloW leiC aC± ? enCodin ^ an antibody specific for the G-protein 

PT CCR5 ' USSfUl ** tre — «* ^osls of e. g . 
XX 

PS Disclosure; Page 165-166; 180pp; English. 
XX 

rr ^ P^ esent invention relates to the isolation of a novel human G-protein 

CC I 0 " 6 reCS S t0r (CCR5> desi 9 nated HDGNRIO, and polynucleotide P 

CC sequences encoding it. The invention also describes antibodies that bind 

CC human G-protein chemokine receptor (CCR5) HDGNRIO and polynucleotide 

CC sequences encodmg the antibodies. The antibodies are useful for treatino 

CC celL re ^ i'cellfT^ 1011 ' defecti ™ " <*«»°t.*ia o? i£^e * 

CC cells and T-cell/antigen-presenting cell interactions, infections and 

CC inf^r f Seases '/ h — toid arthritis, neurodegeneration? viral 

CC cvtZln T (eSpeCla11 ^ e "ly-stage human immune deficiency virus (HIV), 

CC associated V Stn S P ° X ^^J*^*™ • Kaposi sarcoma, or conditions' 

CC ociated wxth aberrant or deficient expression of the CCR5 receptor or 

CC its ligands. The antibodies are also useful to determine CCR5 expression 

CC e.g. for diagnosis, prognosis and monitoring of cancer and other 

CC G-proteinlh"^ 6 ' ^ P 01 ^- leotide sequences encoding human 

CC 2 n chemokme receptor (CCR5) HDGNRIO can be used to produce the 

CC suenrinfect"^ 0 ;' ^ treatment ° f * wide range of diseases 

CC Parkinson' \ T a T f""^ (e ' g " infl — > > neurological diseases (e.g. 

rr I" 1 disease), and hyperproliferative disorders (e.g. cancer) The 

CC HDgSxO #2 6qUenCe rSpreSentS h ™» G -P-tein chemokine receptor (CCR5 ) 
XX 

SQ Sequence 352 AA; 


Query Match 

Best Local Similarity 


47.6%; 
35.0%; 


Score 379; DB 5; Length 352; 
Pred. No. 3.2e-31; 


Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 
QY 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 

N I I I I I I I I I I I I I I | I | | | | | | | | | | | | 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QV 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG— 64 

61 LKSMTDI YLLNLAI S DLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFI GFFSGI FF 118 


Db 

Qy 65 


LCTRSQKEGLHYTC 7 8 

Db 119 1 1 LLTI DRYLAVVHAVFALKARTVTFGVVT S VI TWWAVFAS LPGIIFTRS QKEGLHYTC 178 

QY 79 SSHFPYSQYQFWKNFQTLKI HORWr 

I M I I I I I I I I | | | | | | | | | HQRVHGG 105 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

Db 239 TIMIWFLFWAPYNI VXLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


Search completed: September 17, 2004, 16:45:08 
Job time : 83.461 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 
Run on: 


Title: 

Perfect score: 
Sequence: 

Scoring table: 


Searched: 


September 17, 2004, 16:42:06 ; Search time 21.5186 Seconds 

(without alignments) 
331.080 Million cell updates/sec 

US-10-057-890A-10 
797 

1 MD YQVS S P I YD I N YYT S E P C GLNNCS S SNRLDGHQRVHAA 138 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 
389414 seqs, 51625971 residues 


Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 


389414 


Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


Database : 


Issued_Patents_AA: * 

/cgn2_6/ptodata/2/iaa/5A_COMB.pep:* 
/cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 
/cgn2_6/ptodata/2/iaa/6A_COMB.pep:* 
/cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 
/cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep : * 
/cgn2_6/ptodata/2/iaa/backf ilesl . pep : * 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 


Result 
No. 


Score 


Query 

Match Length DB 


ID 


Description 


1 

379 

47. 

6 

352 

3 

US- 

09- 

087- 

232A-13 

Sequence 

13, Appl 

2 

379 

47. 

6 

352 

3 

us- 

08- 

861- 

105-14 

Sequence 

14, Appl 

3 

379 

47. 

6 

352 

3 

US- 

08- 

575- 

967A-2 

Sequence 

2, Appli 

4 

379 

47. 

6 

352 

4 

us- 

08- 

833- 

752-5 

Sequence 

5, Appli 

5 

379 

47. 

6 

352 

4 

US- 

09- 

502- 

7 83A-2 

Sequence 

2, Appli 

6 

379 

47. 

6 

352 

4 

US- 

09- 

796- 

202-1 

Sequence 

1, Appli 

7 

374 

46. 

9 

352 

3 

us- 

09- 

045- 

583-52 

Sequence 

52, Appl 

8 

374 

46. 

9 

352 

4 

us- 

09- 

534- 

185-52 

Sequence 

52, Appl 

9 

373 

46. 

8 

352 

3 

us- 

08- 

466- 

343D-2 

Sequence 

2, Appli 

10 

359 

45. 

0 

352 

4 

us- 

09- 

517- 

605-5 

Sequence 

5, Appli 

11 

272 

34. 

1 

354 

4 

us- 

08- 

724- 

984A-2 

Sequence 

2, Appli 


1 o 

12 

258 

32 . 4 

184 


258 

32 . 4 

215 

1 A 

14 

258 

32 . 4 

215 

1 c 

lo 

182 . 5 

22 . 9 

100 

16 

159 

19 . 9 

87 

1 ""7 
1 / 

148.5 

18 . 6 

488 

1 o 
lo 

148.5 

18 . 6 

488 

ly 

128 

16. 1 

54 

z u 

128 

16. 1 

313 

Zl 

128 

16. 1 

326 

z2 

123 . 5 

15.5 

717 

Z3 

123 

15. 4 

22 

A 4 

121 

15. 2 

518 

o c: 

zb 

121 

15. 2 

711 

zb 

121 

15.2 

711 

27 

121 

15 . 2 

711 


121 

15.2 

711 

29 

121 

15.2 

711 

30 

121 

15.2 

711 

31 

120.5 

15. 1 

185 

32 

119 . 5 

15. 0 

185 

33 

118 . 5 

14 . 9 

315 

34 

118.5 

14 . 9 

532 

35 

117 

14 . 7 

457 

36 

117 

14.7 

647 

37 

114 . 5 

14.4 

543 

38 

114 . 5 

14 . 4 

675 

39 

114 . 5 

14 . 4 

675 

4 (J 

114 . 5 

14 . 4 

675 

41 

114.5 

14.4 

675 

42 

114.5 

14.4 

675 

43 

114.5 

14.4 

675 

44 

114 

14.3 

21 

45 

113.5 

14.2 

462 


4 

US 

-08 

-833-752-4 

3 

US 

-09 

-087-232A-17 

4 

us 

-08 

-833-752-6 

3 

us 

-09 

-087-232A-15 

3 

us 

-09 

-087-232A-18 

2 

us 

-08 

-933-750C-17 

3 

us 

-09 

-234-613-17 

4 

us 

-08 

-833-752-11 

4 

us 

-09 

-800-729-196 

4 

us 

-09 

-800-729-195 

4 

us- 

-09 

-881-578A-2 

3 

us- 

-08- 

-861-105-4 

4 

us- 

-09- 

-881-578A-4 

2 

us- 

-08- 

-820-170A-10 

3 

us- 

-09- 

-055-699-10 

3 

us- 

-09- 

-273-565-10 

4 

us- 

-09- 

-565-538-10 

4 

us- 

-09- 

-661-468-10 

4 

us- 

-09- 

-976-165-10 

4 

us- 

-09- 

-494-190-125 

4 

us- 

-09- 

-494-190-126 

1 

us- 

■08- 

253-155A-34 

4 

us- 

09- 

389-956-80 

4 

us- 

09- 

389-956-68 

4 

us- 

09- 

389-956-6 

4 

us- 

09- 

362-123A-4 

1 

us- 

08- 

317-522A-9 

1 

us- 

08- 

439-818A-9 

2 

us- 

08- 

751-965-9 

2 

us- 

08- 

738-975-9 

2 

us- 

08- 

728-626-9 

3 

us- 

08- 

808-599A-9 

3 

us- 

08- 

907-468-11 

3 

us- 

08- 

486-099-117 


Sequence 4, Appli 
Sequence 17, Appl 
Sequence 6, Appli 
Sequence 15, Appl 
Sequence 18, Appl 
Sequence 17, Appl 
Sequence 17, Appl 
Sequence 11, Appl 
Sequence 196, App 
Sequence 195, App 
Sequence 2, Appli 
Sequence 4, Appli 
Sequence 4, Appli 
Sequence 10, Appl 
Sequence 10, Appl 
Sequence 10, Appl 
Sequence 10, Appl 
Sequence 10, Appl 
Sequence 10, Appl 
Sequence 125, App 
Sequence 126, App 
Sequence 34, Appl 
Sequence 80, Appl 
Sequence 68, Appl 
Sequence 6, Appli 
Sequence 4, Appli 
Sequence 9, Appli 
Sequence 9, Appli 
Sequence 9, Appli 
Sequence 9, Appli 
Sequence 9, Appli 
Sequence 9, Appli 
Sequence 11, Appl 
Sequence 117, App 


ALIGNMENTS 


RESULT 1 

US-09-087-232A-13 

; Sequence 13, Application US/09087232A 

; Patent No. 6153431 

GENERAL INFORMATION: 

; APPLICANT: Quillent et al . 

TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 
TITLE OF INVENTION: VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS 

INFECTION . 

NUMBER OF SEQUENCES: 23 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Baker & Botts, L.L.P. attn. Lisa Kole 

STREET: 30 Rockefeller Plaza 
CITY: New York 
STATE: New York 
COUNTRY: USA 
ZIP: 10112 
; COMPUTER READABLE FORM: 


MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/087 , 232A 
FILING DATE: 28 MAY 1998 
; CLASSIFICATION: 435 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 60/048,057 

FILING DATE: 30 MAY 1997 
ATTORNEY/AGENT INFORMATION: 
NAME: KOLE, LISA B. 
REGISTRATION NUMBER: 35,225 
REFERENCE/DOCKET NUMBER: AP 31115 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (212) 408-2628 

; TELEFAX: (212) 765-2519 

INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-087-232A-13 


Query Match 47.6%; Score 379; DB 3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 1 _ I I I I I I I I I I I I | : | : : | 


LCTRSQKEGLHYTC 78 
[ILLTIDRYLAWHAVFALKARTVTFGVVTSVITWW 178 


^u.i^iv^iuu^ilm HQRVHGG 105 

I I M I I I I I I I I | | | | | | | | , 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I :| 


RESULT 2 

US-08-861-105-14 

; Sequence 14, Application US/08861105 
; Patent No. 6258527 
; GENERAL INFORMATION: 

APPLICANT: LITTMAN, DAN R. 


APPLICANT: DENG, HONGKUI 
APPLICANT: ELLMEIER, WILFRIED 
APPLICANT: LANDAU, NATHANIEL R. 
APPLICANT: LIU, RONG 

TITLE OF INVENTION: G-COUPLED RECEPTORS ASSOCIATED WITH 

TITLE OF INVENTION: MACROPHAGE-TROPHIC HIV, AND DIAGNOSTIC AND THERAPEUTIC 
TITLE OF INVENTION: USES THEREOF 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: David A. Jackson, Esq. 

STREET: 411 Hackensack Ave, Continental Plaza, 4th 
STREET: Floor 
CITY: Hackensack 
STATE: New Jersey 
COUNTRY: USA 
ZIP: 07601 
• COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 8 61 , 1 05 
FILING DATE: 
CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/666,020 
FILING DATE: 19-JUN-1996 
CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/227,319 
FILING DATE: 13-APR-1994 
ATTORNEY/AGENT INFORMATION: 
NAME: Jackson Esq., David A. 
REGISTRATION NUMBER: 26,742 
REFERENCE/DOCKET NUMBER: 1049-1-004 Nl 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 2 01-487-58 00 
TELEFAX: 2 01-343-1684 
INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
HYPOTHETICAL: NO 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
US-08-861-105-14 

Query Match 47.6%; Score 379; DB3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

2^ 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I M M I I I I I I I I I I I I | | | | | | | | | | | | | 


Db 


Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

nu ^ 1 1 I I I I I I I I I I I I I : | : : | 

UD 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qv 65 

1 LCTRSQKEGLHYTC 78 

119 IILLTIDRYLAVVHAVFALKARTWFGVVTSVITWW^ 178 


^ 79 SSHFPYSQYQFWKNFQTLKI HORVHGG- 105 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 .. | 

Db 179 S SHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVI CYSGILKTLLRCRNEKKRHRAVRLI F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I M I I I I I I I I I I I I I I : | 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 3 

US-08-575-967A-2 

Sequence 2, Application US/08575967A 
Patent No. 6265184 
GENERAL INFORMATION: 

APPLICANT: Gray et al . 

TITLE OF INVENTION: Chemokine Receptor Materials and Methods 
NUMBER OF SEQUENCES : 16 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & Borun 
STREET: 6300 Sears Tower, 233 S. Wacker Drive 
CITY: Chicago 
STATE: Illinois 
COUNTRY: USA 
ZIP: 60606 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/575, 967A 
FILING DATE: 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: No. 6265184and, Greta E. 
REGISTRATION NUMBER: 35,302 
REFERENCE/DOCKET NUMBER: 32918 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 2 06-485-19 00 
TELEFAX: 206-485-1662 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
FEATURE : 


; NAME /KEY: misc_feature 

; OTHER INFORMATION: /= "88C amino acid sequence" 

US-08-575-967A-2 

Query Match 47.6%; Score 379; DB 3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

^ 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I M I I I I I I | | M I I I I I I I I I || I | | | | | 
Db 1 MDYQVS S P I YDIN YYTSEPCQKINVKQIAARLLPPLYSLVFI FGFVGNMLVI LI LINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 I 1111111111111:1: : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 65 


LCTRSQKEGLHYTC 78 

Db 119 IILIjTIDRYLAvvHAVFALK^^ 


178 


^ 79 SSHFPYSQYQFWKNFQTLKI HORVHGG 1 OS 

I I I I I I II I I I II I I I I I I I :: | 

SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVTWICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Db 17 9 


Q y 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 

RESULT 4 
US-08-833-752-5 

Sequence 5, Application US/08833752 
Patent No. 6448375 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 
APPLICANT: PARMENTIER, MARC 
APPLICANT: VASSART, GILBERT 
APPLICANT: LIBERT, FREDERICK 

TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE : CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 833 , 752 
FILING DATE: 9-APR-1997 
CLASSIFICATION: 536 


; ATTORNEY/ AGENT INFORMATION: 

NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/ DOCKET NUMBER: 

; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

; LENGTH: 352 amino acids 

TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 

US-08-833-752-5 


Query Match 47.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MD YQVS S PI YDIN YYTS E PCQKINVKQI AA 30 

M I I I I I I I II I I I I I I | | | | | | | | | | | | | 

Db 1 ^DYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : | :: | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

QY 65 LCTRSQKEGLHYTC 7 8 

Db 119 1 1 LLT I DRYLAVVHAVFAIjKARTVT FGWT S VI TWWAVFAS LPGI I FTRSQKEGLH YTC 178 

QY 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I | | | | | . . | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Q y 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I ! I I I I I I I I I I I I I I I : | 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 5 

US-09-502-783A-2 

; Sequence 2, Application US/09502783A 

; Patent No. 6511826 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven M. 

; TITLE OF INVENTION: Polynucleotides. Encoding Human G-Protein Chemokine 
Receptor (CCR5) 

; TITLE OF INVENTION: HDGNR10 

; FILE REFERENCE: 1488.1150006 

; CURRENT APPLICATION NUMBER: US/09/502 , 783A 

; CURRENT FILING DATE: 2001-08-23 

; PRIOR APPLICATION NUMBER: 08/466,343 

; PRIOR FILING DATE: 1995-06-06 

; NUMBER OF SEQ ID NOS : 9 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 2 

LENGTH: 352 
; TYPE: PRT 


; ORGANISM: Homo sapiens 
US-09-502-783A-2 


Query Match 4 7.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Q y 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | 
Db 1 MDYQVS S P I YDINYYTSEPCQKINVKQIAARLLPPLYS LVFI FGFVGNMLVI LI LINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

Db 61 -kKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

Db 119 IlLLTIDRYLAVVTiAVFALKART^ 178 


Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

: ' 'LLliL' 1 J_' I I MM I I :: | 

F 238 


Db 17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLI 


Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I M II I I I | | IIIIIM :| 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 
US-09-796-202-1 

; Sequence 1, Application US/09796202 

; Patent No. 6548636 

; GENERAL INFORMATION: 

; APPLICANT: Dragic, Tat j ana 

; APPLICANT: Olson, William 

; TITLE OF INVENTION: SULFATED CCR5 PEPTIDES FOR HIV-1 INFECTION 

; FILE REFERENCE: 2048/61010/ JPW/SHS 

; CURRENT APPLICATION NUMBER: US/ 09/796, 202 

; CURRENT FILING DATE: 2001-02-28 

; NUMBER OF SEQ ID NOS : 17 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 1 

LENGTH: 352 

TYPE : PRT 
; ORGANISM: human 
US-09-796-202-1 


Query Match 47.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 
Q y 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

M M I M I II I II I I I I I I I I I I I I I I I I I 

Db 1 ^^YQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFI FGFVGNMLVI LI LINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 1 I M I I I I I I I || | : | : : | 


Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

QY 65 LCTRSQKEGLHYTC 7 8 

nK ii n : M I I I I I I I I I I 

DiD 119 1 1 LLT I DRYLAVVHAVFALKART VT FGVVT S VI TWWAVFAS L P GI I FTRSQKEGLH YT C 17 8 

Q y 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

i I I ! I I I I I I I II I I i I I I I | 
Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Q y 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : | 

Db 2 39 TIMIVYFLFWAP YNI VLLLNT FQEFFGLNNCS S SNRLDQAMQV 281 

RESULT 7 

US-09-045-583-52 

Sequence 52, Application US/09045583 
Patent No. 6287805 
GENERAL INFORMATION: 

APPLICANT: Graham, Gerard J. et al . 

TITLE OF INVENTION: No. 6287805el Molecules of the G Protein-Coupled 
NUMBER OF SEQUENCES: 5 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD, LLP 
STREET: 28 State Street 
CITY: Boston 
STATE: Massachusetts 
COUNTRY: USA 
ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/045, 583 
FILING DATE: 20-MAR-98 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: Mandragouras, Amy E. 
REGISTRATION NUMBER: 36,207 
REFERENCE/DOCKET NUMBER: MNI-044 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617)227-74 00 
TELEFAX: ( 617 ) 742-42 14 
INFORMATION FOR SEQ ID NO: 52: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FRAGMENT TYPE: internal 
US-09-045-583-52 


Query Match 46.9%; Score 374; DB 3; Length 352; 

Best Local Similarity 34.6%; Pred. No. 6e-34; 

Matches 98; Conservative 8; Mismatches 27; Indels 150; Gaps 7; 
Qy . 1 MDYQVS S PI YDIN YYT SEP CQKINVKQI AA 30 

I M I I I I I I I | | : M I I I I I | | | | | | | | | | 

Db 1 ^DYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 1 I I I I I I I I I I I I I : | : : | 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Db 

Qy 65 


Db 119 1 1 LLT I DRYLAI VHAVFALKART VT FGWT SVITWWAVFAS LP 


LCTRSQKEGLHYTC 78 
: I I I I I I I I I I I I 
GIIFTRSQKEGLHYTC 17 8 


Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG — 105 

I I I I I I I I I I I I I I I II II I .. , 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 105 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : | 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 8 

US-09-534-185-52 

; Sequence 52, Application US/09534185 
; Patent No. 6403767 

GENERAL INFORMATION: 
; APPLICANT: Graham, Gerard J. et al. 

; TITLE OF INVENTION: No. 6403767el Molecules of the G Protein-Coupled 

Heptahelical Receptor Superfamily and Uses 
Therefor 
NUMBER OF SEQUENCES: 56 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD, LLP 

STREET : 28 State Street 

CITY: Boston 

STATE : Massachusetts 

COUNTRY: USA 

ZIP: 02109 

* COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 9/ 534 , 185 
' FILING DATE: 24-Mar-2000 

' CLASSIFICATION: <Unknown> 

' PRIOR APPLICATION DATA: 

' APPLICATION NUMBER: 09/045,583 

• FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 

NAME: Mandragouras, Amy E. 


REGISTRATION NUMBER: 36,207 
; REFERENCE/ DOCKET NUMBER: MNI-044 

TELECOMMUNICATION INFORMATION: 
TELEPHONE : (617)22 7-74 00 
TELEFAX : (617)742-4214 
INFORMATION FOR SEQ ID NO: 52: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 352 amino acids 

TYPE: amino acid 
; TOPOLOGY: linear 

; MOLECULE TYPE: peptide 

FRAGMENT TYPE: internal 
SEQUENCE DESCRIPTION: SEQ ID NO: 52- 
US-09-534-185-52 


Query Match 46.9%; Score 374; DB 4; Length 352; 

Best Local Similarity 34.6%; Pred. No. 6e-34; 

Matches 98; Conservative 8; Mismatches ^27; Indels 150; Gaps 7; 


Qy 

l 

Db 

i 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


I I I I I I N I I I I : I I I I I I I I I I I I I I I I I 

MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 I I I I I I I I I I I I I I : | : : I 


LCTRSQKEGLHYTC 78 
IILLTIDRYLAIVKAVFALKARTVTFGVW 178 


^uL.i^ W r«mKViLM HQRVHGG 105 

I I M I I I I I I I | | M | | | | | .. | 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

! I I I I I I I I I I I | M I I I : I 


RESULT 9 

US-08-466-343D-2 

Sequence 2, Application US/08466343D 
Patent No. 6025154 
GENERAL INFORMATION: 
APPLICANT: LI, Yi 

TITLE OF INVENTION: POLYNUCLEOTIDES ENCODING HUMAN G-PROTEIN 
TITLE OF INVENTION: CHEMOKINE RECEPTOR HDGNR10 (AS AMENDED) 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX P L L C 
STREET: 1100 NEW YORK AVE., NW, SUITE 600 
CITY: WASHINGTON 
STATE: DC 
COUNTRY: USA 
ZIP: 20005 
COMPUTER READABLE FORM: 


MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/4 66, 343D 
FILING DATE: 06-JUN-1995 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: STEFFE, ERIC K. 
REGISTRATION NUMBER: 36,688 

REFERENCE/DOCKET NUMBER: 14 8 8 . 1150000/EKS/KLM 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (202) 371-2600 

TELEFAX: (202) 371-2540 
INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-466-343D-2 


Query Match 46.8%; Score 373; DB 3; Length 352; 

Best Local Similarity 34.6%; Pred. No. 7.8e-34; 

Matches 98; Conservative 7; Mismatches 28; Indels 150; Gaps 7; 

QY 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I M I I I II I I I I I I I M II 
Db 1 ^DYQVSSPIYDINYYTSEPCPKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCQR 60 

Qy 31 — YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I II I I I I : | : : | 

Db 61 LESMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

QY 65 LCTRSQKEGLHYTC 7 8 

Db 119 ! I LLT I DRYLAI VHAVFALKARTVT FGWT S VI TWWAVFAS L PGI I FTRSQKEGLH YTC 178 

QY 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I | | | | | | | | . . | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Q y 106 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

I I M I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 
US-09-517-605-5 

; Sequence 5, Application US/09517605 

; Patent No. 6391567 

; GENERAL INFORMATION: 

; APPLICANT: Littman, Dan R. 

; APPLICANT: Kwon, Douglas S. 

; APPLICANT: van Kooyk, Yvette 

; APPLICANT: Geijtenbeck, Tneo 


; TITLE OF INVENTION: METHODS OF USING A FACILITATOR OF RETROVIRAL ENTRY INTO 
; TITLE OF INVENTION: CELLS 

FILE REFERENCE: 1049-1-017 
; CURRENT APPLICATION NUMBER: US/09/517 , 605 
; CURRENT FILING DATE: 2000-03-02 
; NUMBER OF SEQ ID NOS : 17 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 5 

LENGTH: 352 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-517-605-5 


Query Match 45.0%; Score 359; DB 4; Length 352; 

Best Local Similarity 33.9%; Pred. No. 2.9e-32; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7; 

^ 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I I I I I | I | 
Db 1 ^DYQVSSPTYDIDYDTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILVLINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 I I I I I I I I I I I I I I : | :: | 

Db 61 LKSMTD IYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

QY 65 LCTRSQKEGLHYTC 7 8 

Db 119 llLLTIDRYi^I VTiAVFALKARTVT 178 

QY 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I II I I I I :: | 

Db 179 s SHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVI CYSGI LKTLLRCRNEKKRHRAVRLI F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I M | : | 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
US-08-724-984A-2 

; Sequence 2, Application US/08724984A 
; Patent No. 6388055 
; GENERAL INFORMATION: 

APPLICANT: Derk Bergsma, Mary Brawner, and Usman Shabon 

TITLE OF INVENTION: No. 6388055el Mouse Genomic Clone of the CC- 

TITLE OF INVENTION: CKR5 Receptor 

NUMBER OF SEQUENCES: 5 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: SmithKline Beecham Corporation 
STREET: 709 Swedeland Road, P.O. Box 1539 
CITY: King of Prussia 
; STATE: PA 

COUNTRY: USA 
ZIP: 19406-0939 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: DISKETTE, 3.5 INCH, 1.44 Mb STORAGE 

COMPUTER: IBM 486 


; OPERATING SYSTEM: WINDOWS FOR WORKGROUPS 

; SOFTWARE : MICROSOFT WORD 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/724 , 9 8 4A 
FILING DATE: October 3, 199 6 
CLASSIFICATION: 8 00 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 

; FILING DATE: 

; ATTORNEY/AGENT INFORMATION: 
NAME: William T. Han 
REGISTRATION NUMBER: 34,344 
REFERENCE/DOCKET NUMBER: ATG50023 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 610 270 5024 

TELEFAX: 610 270 5090 
; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 354 
TYPE: Amino Acid 
TOPOLOGY: Linear 
US-08-724-984A-2 


Query Match 34.1%; Score 272; DB 4; Length 354; 

Best Local Similarity 27.9%; Pred. No. 1.6e-22; 

Matches 80; Conservative 12; Mismatches 29; Indels 166; Gaps 8; 

^ 1 MDYQVSSP — I YDINYYTSEPCQKINVKQIAA 30 

i I : I I I I I I | : | I I I I I | | | | | | | | 
Db 1 MDF Q GSVp TYIYDIDYGMSAPCQKINVKQIAAQLLPPLYSLVFIFGFAGNMMVFLILISC 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG- 64 

11 I I : I I I I I I : I : I | | 

61 KKLKSV TDIYLLNLAISDLLFLLTLPFWAHYAANEWIFGNIMC— KVFTGVYHIGYFGGI 118 

QY 65 LCTRSQKEGLHY 76 

119 FFIILLTIDRYLAI VTiAVFAXKVTTWFGV^ 178 


Db 


Qy 77 TCSSHFPYSQYQFWKNFQTLK IHQRVHGGGGSYKC 111 

I I I I I I I I I I I : I I I I I : | . . , 

Db 179 TCSPHFPHTQYHFWKSFQTLKMVILSLILPLLVMIICYSGILHT — LFRCRNEKK 231 

Qy 112 GLCQEFFGLNNCSSSNRLD 130 

I I I I I I II I I I I I I I I 

Db 232 RHRAVRLIFAIMIVYFLFWTPYNIVLLLTTFQEFFGLNNCSSSNRLD 278 


RESULT 12 
US-08-833-752-4 

; Sequence 4, Application US/08833752 

; Patent No. 6448375 

; GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 
APPLICANT: PARMENTIER, MARC 

; APPLICANT: VASSART, GILBERT 
APPLICANT: LIBERT, FREDERICK 


; TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 

TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
; NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 

STREET: 620 Newport Center Drive 16th Floor 

CITY: Newport Beach 
; STATE: CA 

COUNTRY: U.S.A. 

ZIP: 92660 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 08/ 833 , 752 

; FILING DATE: 9-APR-1997 

CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
; REGISTRATION NUMBER: 34,115 

REFERENCE/DOCKET NUMBER: 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 184 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-752-4 


Query Match 32.4%; Score 258; DB 4; Length 184; 

Best Local Similarity 34.9%; Pred. No. 2.7e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4; 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 


I I I I I I II I I I I I I I I I It I I I I II I I I I 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I M I I I I I I I I I I I : | : : | 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


-LCTRSQKEGLHYTC 78 
: I I I I I I I I I I I I 


RESULT 13 
US-09-087-232A-17 

; Sequence 17, Application US/09087232A 
; Patent No. 6153431 
; GENERAL INFORMATION: 


APPLICANT: Quillent et al . 

TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO- RECEPTOR 
TITLE OF INVENTION: VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS 
INFECTION. 

NUMBER OF SEQUENCES: 23 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Baker & Botts, L.L.P. attn . Lisa Kole 

; STREET: 30 Rockefeller Plaza 

CITY: New York 
STATE: New York 
; COUNTRY: USA 

; ZIP: 10112 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/087 , 232A 

; FILING DATE: 28 MAY 1998 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 60/048,057 

FILING DATE: 30 MAY 1997 
ATTORNEY/AGENT INFORMATION: 
NAME: KOLE, LISA B. 
REGISTRATION NUMBER: 35,225 
; REFERENCE/DOCKET NUMBER: AP 31115 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 408-2628 
; TELEFAX: (212) 765-2519 

; INFORMATION FOR SEQ ID NO: 17: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 215 amino acids 

TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-087-232A-17 


Query Match 32.4%; Score 258; DB 3; Length 215; 

Best Local Similarity 34.9%; Pred. No. 3.3e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4; 

QY 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Q y 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I II I I I I I I : | : : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

QY 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLT IDRYIAVVHAVFALKARTVTFGWTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 


Qy 


79 SSHFPY 84 
I I I I II 


Db 


179 SSHFPY 184 


RESULT 14 
US-08-833-752-6 

Sequence 6, Application US/08833752 
Patent No. 6448375 
GENERAL INFORMATION : 

APPLICANT: SAMSON, MICHEL 
APPLICANT: PARMENTIER, MARC 
APPLICANT: VAS S ART , GILBERT 
APPLICANT: LIBERT, FREDERICK 

TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/833,752 
FILING DATE: 9-APR-1997 
CLASSIFICATION: 536 
ATTORNEY/ AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/DOCKET NUMBER: 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 215 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-752-6 

Query Match 32.4%; Score 258; DB 4; Length 215; 

Best Local Similarity 34.9%; Pred. No. 3.3e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4; 
QY 1 MD YQ VS S P I YD I N Y YT SEP CQ KI NVKQ I AA 30 

I M M I I I I I I I I I I I | M I I I I I I I I I I I 

Db 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 I I I I II I II I I I I I : I : : | 

Db 61 LKSMTDI YLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

QY 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 


119 IILLTIDRYIAVVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 


QY 79 SSHFPY 84 

I I I I I I 

Db 179 SSHFPY 184 


RESULT 15 
US-09-087-232A-15 

; Sequence 15, Application US/09087232A 
; Patent No. 6153431 
; GENERAL INFORMATION: 

APPLICANT: Quillent et al . 
; TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 

TITLE OF INVENTION: VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS 
INFECTION. 

NUMBER OF SEQUENCES: 23 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Baker & Botts, L.L.P. attn. Lisa Kole 
; STREET: 30 Rockefeller Plaza 

; CITY: New York 

STATE: New York 
COUNTRY: USA 
; ZIP: 10112 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 087 , 232A 
FILING DATE: 28 MAY 1998 
CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/048,057 
FILING DATE: 30 MAY 1997 
ATTORNEY/AGENT INFORMATION: 
NAME: KOLE, LISA B. 
REGISTRATION NUMBER: 35,225 
REFERENCE/ DOCKET NUMBER: AP 31115 
; TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 408-2628 
TELEFAX: (212) 765-2519 
; INFORMATION FOR SEQ ID NO: 15: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 100 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-09-087-232A-15 


Query Match 22.9%; Score 182.5; DB 3; Length 100; 

Best Local Similarity 43.0%; Pred. No. 3.8e-13; 

Matches 43; Conservative 0; Mismatches 4; Indels 53; Gaps 2; 

Q y 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 


Db 1 ^DYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILIN'CKR 60 

Qy 31 YKCGLC AAAQWDFGNTM 47 

11 I I I I I I II I II 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTM 100 


Search completed: September 17, 2004, 16:49:03 
Job time : 23.5186 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 


Run on: 


Title: 

Perfect score: 
Sequence : 


September 17, 2004, 16:42:06 ; Search time 22.4542 Seconds 

(without alignments) 
591.177 Million cell updates/sec 

US-10-057-890A-10 
797 

1 MDYQVSSPIYDINYYTSEPC GLNNC S S SNRLDGHQRVHAA 138 


Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


283366 


Database 


PIR_78:* 
pirl : * 
pir2 : * 
pir3: * 
pir4 : * 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 


RESULT 1 
A43113 

chemokine (C-C) receptor 5 - human 
N;Alternate names: C-C CKR-5; CCR5 
C; Species: Homo sapiens (man) 

C;Date: 12-Jul-1996 #sequence_revision 12-Jul-1996 #text_change 20-Jun-2000 
C;Accession: A43113; S71808; A58834; A58832; G02653; A58833 
R;Samson, M. ; Labbe, 0.; Mollereau, C; Vassart, G. ; Parmentier, M. 
Biochemistry 35, 3362-3367, 1996 

A;Title: Molecular cloning and functional expression of a new human CC-chemokine 
receptor gene. 

A; Reference number: A43113; MUID : 96241590 ; PMID: 8639485 
A;Accession: A43113 
A;Molecule type: mRNA 
A; Residues: 1-352 <SAM1> 

A;Cross-references: GB:X91492; NID : gl262810 ; PIDN : CAA62796 . 1 ; PID:gl262811 
R;Samson, M. ; Libert, F . ; Doranz, B.J.; Rucker, J.; Liesnard, C; Farber, CM.; 
Saragosti, S.; Lapoumeroulie, C. ; Cognaux, J.; Forceille, C; Muyldermans, G.; 
Verhofstede, C. ; Burtonboy, G. ; Georges, M. ; Imai, T.; Rana, S.; Yi, Y. ; Smyth, 
R.J.; Collman, R.G.; Doms, R.W.; Vassart, G. ; Parmentier, M. 


Nature 382, 722-725, 1996 

A;Title: Resistance to HIV-1 infection in Caucasian individuals bearing mutant 
alleles of the CCR-5 chemokine receptor gene. 
A;Reference number: S71808; MUID : 96345670 ; PMID:8751444 
A;Accession: S7180 8 

A; Status: nucleic acid sequence not shown; not compared with conceptual 

translation 

A;Molecule type: DNA 

A;Residues: 182-206 ; 2 07-230 <SAM2> 

A;Accession: A58 834 

A; Status: nucleic acid sequence not shown; not compared with conceptual 

translation 

A; Molecule type: DNA 

A; Residues: 1-184, ' IKDSHLGAGPAAACHGHLLLGNPKNSASVSK T <SAM3> 

A;Cross-references: GB:X99393; NID : gl524062 ; PIDN: CAA677 67 . 1; PID:gl524063 
A; Note: this frameshift mutation results in a non-functional receptor but 
confers a degree of resistance to HIV-1 infection; it has an allele frequency of 
0.09 or more in some Caucasian populations and may have had a selective 
advantage by conferring resistance to Yersinia plague infections 
R;Combadiere, C. ; Ahuja, S.K.; Tiffany, H.L.; Murphy, P.M. 
J. Leukoc. Biol. 60, 147-152, 1996 

A;Title: Cloning and functional expression of CC CKR5, a human monocyte CC 

chemokine receptor selective for MlP-lalpha, MlP-lbeta, and RANTES . 

A; Reference number: A58832; MUID : 96295970 ; PMID: 8699119 

A; Accession: A58832 

A; Molecule type: mRNA 

A; Residues: 1-352 <C0M1> 

A; Cross-references: GB:U57840; NID : gl502408 ; PIDN: AAB17071 . 1; PID:gl502409 

A; Experimental source: clone 8, endotoxin-stimulated peripheral blood monocytes 

R; Combadiere, C. 

submitted to the EMBL Data Library, May 1996 
A; Reference number: H01541 
A; Accession: G02 653 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-89, 1 L ' , 91-352 <COM2> 
A;Cross-references : EMBL:U57840 

R;Raport, C.J.; Gosling, J.; Schweickart, V.L.; Gray, P.W.; Charo, I.F. 
J. Biol. Chem. 271, 17161-17166, 1996 

A;Title: Molecular cloning and functional characterization of a novel human CC 

chemokine receptor (CCR5) for RANTES, MlP-lbeta, and MlP-lalpha. 

A; Reference number: A58833; MUID : 96291862 ; PMID: 8663314 

A; Access ion: A58 8 33 

A;Molecule type: mRNA 

A; Residues: 1-352 <RAP> 

A;Cross-references : GB:U54994; NID : gl457945 ; PIDN : AAC50598 . 1 ; PID:gl457946 
C;Comment: This is a receptor for chemokines MlP-lalpha (see PIR:A30574), MIP- 
Ibeta (see PIR:A31767) , and RANTES (see PIR:A28815). 

C; Comment: Macrophage- and dual-tropic strains of HIV-1 bind to a complex of 
chemokine (C-C) receptor 5 and T-cell surface glycoprotein CD4 (see PIR:RWHUT4). 
C; Genetics : 

A; Gene: GDB : CMKBR5 ; CCR5; CKR-5; CC-CKR-5; CKR5 ; ChemR13 
A;Cross-references : GDB: 1230510; OMIM: 601373 
A; Map position: 3p21-3p21 
C; Function : 

A; Description: G protein-coupled receptor for chemokines MlP-lalpha, MlP-lbeta 
and RANTES 


A;Note: probably acts to control granulocyte proliferation and differentiation 
C; Super family : vertebrate rhodopsin 

C; Keywords: AIDS; G protein-coupled receptor; glycoprotein; phosphoprotein; 
transmembrane protein 

F; 32-56/Domain : transmembrane #status predicted <TM1> 
F; 67- 87/ Domain : transmembrane #status predicted <TM2> 
F; 103-124/Domain: transmembrane #status predicted <TM3> 
F; 142-166/Domain: transmembrane #status predicted <TM4> 
F; 193-218/Domain: transmembrane #status predicted <TM5> 
F; 236-257/Domain: transmembrane #status predicted <TM6> 
F;285-300/Domain: transmembrane #status predicted <TM7> 
F;20-269,101-178/Disulfide bonds: #status predicted 

F; 268/Binding site: carbohydrate (Asn) (covalent) ((status predicted 

F; 336, 337, 342/Binding site: phosphate (Ser) (covalent) #status predicted 

F; 340, 343/Binding site: phosphate (Thr) (covalent) #status predicted 

Query Match 47.6%; Score 379; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 2.5e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MD YQVS S P I YDIN YYT SEP CQKINVKQI AA 30 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I II I I I I I I : I : : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I II I I II I I I I 

Db 119 1 1 LLT I DRYLAVVHAVFALKARTVT FGWT S VI TWWAVFAS LP GI I FTRS QKEGLH YTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I II I I I I I : : | 

Db 17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I II I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 2 
157505 

zinc finger protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Aug-1996 #sequence_revision 02-Aug-1996 #text_change 05-Nov-1999 
C;Accession: 157505 

R;Ernoult-Lange, M. ; Kress, M. ; Hamer, D. 
Mol. Cell. Biol. 10, 418-421, 1990 

A; Title: A gene that encodes a protein consisting solely of zinc finger domains 
is preferentially expressed in transformed mouse cells. 
A;Reference number: 157505; MUID : 90097859; PMID:2104662 
A; Access ion: 157505 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-201 <RES> 

A;Cross-references: GB:M32057; NID:gl99766; PIDN : AAA39729 . 1 ; PID:gl99767 


C; Genetics: 
A; Gene: M0K2 


Query Match 19.3%; Score 154; DB 2; Length 201; 

Best Local Similarity 37.0%; Pred. No. 9.6e-08; 

Matches 44; Conservative 13; Mismatches 44; Indels 18; Gaps 6; 

QY 26 KQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS YKCGLCTR- - SQKEGLHYTCS SHF- 82 

I I : I I : s : : : I I I I I I I I I I : | | II : | 

Db 60 KPYACEECGMSFSQR SNLHIHQRVH-TGERPYKCGEGGKGFSQSSNLHIHRCTHTG 114 

QY 83 — PYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

M I : M hllllll I I I I : I I I :: I I I | | I 

Db 115 EKPYQCYECGKGFSQSSDLRIHLRVHTGEKPYHCGKCGQGF SQSSKLLIHQRVH 168 


RESULT 3 
137570 

zinc finger protein - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change Ol-Dec-2000 
C;Accession: 137570 

R;Abrink, M. ; Aveskogh, M. ; Hellman, L. 
DNA Cell Biol. 14, 125-136, 1995 

A; Title: Isolation of cDNA clones for 42 different Kruppel-related zinc finger 
proteins expressed in the human monoblast cell line U-937 . 
A;Reference number: 137566; MUID : 95169271 ; PMID:7865130 
A; Accession: 137570 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-693 <RES> 

A; Cross-references: EMBL:X78927; NID:g498726; PIDN : CAA55527 . 1 ; PID:g498727 
C; Genetics : 
A; Gene: HZF4 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 18.1%; Score 144; DB 2; Length 693; 

Best Local Similarity 33.3%; Pred. No. 3e-06; 

Matches 39; Conservative 15; Mismatches 33; Indels 30; Gaps 7; 

Qy 21 QKINVKQIAAYKCGLCAAA-QWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCS 7 9 

M:: : Mill : I I I I I I I I I I I II::: 
Db 554 QKVHTGE-KPYKCGECGKGFKWSLNLDM--HQRVH-TGEKPYTCGAC GKHFSQA 603 

Qy 80 SHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

I : | : : | | | | | III : I : I I I : : I I : I I I 

Db 604 S SLQLHQSVHTGEKPYKCDVCGKVF SRSSQLQYHRRVH 641 


RESULT 4 
T46469 

hypothetical protein DKFZp434G1930 . 1 - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 04-Feb-2000 #sequence_revision 04-Feb-2000 #text_change 04-Feb-2000 
C;Accession: T46469 

R;Blum, H. ; Bauersachs, S.; Mewes, H.W.; Gassenhuber, J.; Wiemann, S. 
submitted to the Protein Sequence Database, January 2000 


A;Reference number: Z23034 
A;Accession: T46469 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-319 <AAA> 
A;Cross-references : EMBL : AL1374 83 

A; Experimental source: adult testis; clone DKFZp434G1930 
C; Genetics : 

A;Note: DKFZp434G1930 . 1 

Query Match 18.0%; Score 143.5; DB 2; Length 319; 

Best Local Similarity 32.8%; Pred. No. 1.6e-06; 

Matches 38; Conservative 15; Mismatches 44; Indels 19; Gaps 7; 

Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHF P 83 

Ml I : ::: 111:1 III I :: I I | 

Db 9 YKCNECEKT-FSHRSSLLSHQRIH-TGEKPYKCNECEKAFSNS — STLIKHLRVHTGEKP 64 

QY 84 YSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

I : M I I : I I I : I I I I I I I :: I I I : : I I I I : I 
Db 65 YRCRECGKAFSQCSTLTVHQRIHTGEKLYKCGECEKAF NCRA — KLHRHQRIH 115 


RESULT 5 
153869 

zinc finger protein - mouse 

C; Species: Mus mus cuius (house mouse) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change Ol-Dec-2000 

C;Accession: 153869 

R; Brady, J. P.; Piatigorsky, J. 

Gene 149, 299-304, 1994 

A; Title: A mouse cDNA encoding a protein with zinc-fingers and a KRAB domain 
shows similarity to human prof ilaggrin . 

A; Reference number: 153869; MUID : 950474 92 ; PMID: 7959006 
A; Accession: 1538 69 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-555 <RES> 

A;Cross-references : GB:L28167; NID:g758660; PIDN : AAA67545 . 1; PID:g758661 
C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 17.7%; Score 141; DB 2; Length 555; 

Best Local Similarity 23.8%; Pred. No. 4.7e-06; 

Matches 50; Conservative 25; Mismatches 47; Indels 88; Gaps 11; 

Qy 4 QVSSPIYDINYYTSEPCQ KINVKQIAAYKCGLCAAAQWDFGNTMCQH 50 

: I : I : I I I I : I : : :: | | | : | : : : | 

Db 303 RVCTP VKPYTCEQCEKSLLVDQHLMSHVKVHTRE-RPYNCETCGSA-FSQASHLQDH 357 

Qy 51 QRVH GHHHHS YKCGLCTRSQKEGLHYTCSSHF- 82 

I I = I II I I II I : I : I I I : 

Db 358 QRLHTGEKPFKCDACGKSFSRSSHLRSHQRVHTGEKPYKCGECGKS FICSSNLY 411 

Qy 83 PYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNN 122 

M II : | : | I : | | II I : | : : I | : 

Db 412 IHQRVHTGEKPYKCVDCGKEFSRPSSLQAHQGIHTGEKSYVCTMCGKGYTLNSNLQVHLR 471 


Qy 123 C SSSNRLDGHQRVH 136 

I I I : : I I 1 I I I 

Db 472 VHTGEKPYSCDVCGKGFSRS SQLQSHQRVH 501 


RESULT 6 
F31201 

GLI-related finger protein HKR4 - human (fragments) 
C; Species: Homo sapiens (man) 

C;Date: 31-Mar-1990 #sequence_revision 31-Mar-1990 #text_change 17-Mar-1994 
C;Accession: F31201 

R;Ruppert, J.M. ; Kinzler, K.W. ; Wong, A.J.; Bigner, S.H.; Kao, F.T.; Law, M.L.; 

Seuanez, H.N.; O'Brien, S.J.; Vogelstein, B. 

Mol. Cell. Biol. 8, 3104-3113, 1988 

A; Title: The GLI-Kruppel family of human genes. 

A;Reference number: A93103; MUID : 89096896; PMID:2850480 

A; Accession: F31201 

A;Molecule type: DNA 

A; Residues: 1-223 <RUP> 

C;Keywords: DNA binding; zinc finger 

Query Match 17.4%; Score 138.5; DB 2; Length 223; 

Best Local Similarity 30.7%; Pred. No. 3.4e-06; 

Matches 42; Conservative 13; Mismatches 39; Indels 43; Gaps 7; 

Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHFPYSQYQFW 90 

hill I : : I I I : I : I I II I I I : II : I I : I 
Db 96 YECGQCGRA-FSHSSHFTQHLRIH-NGEKPYKCGECVRHQR— LH TGEKPYACSQCG 14 8 

Qy 91 KNF QTLKIHQRVHGGGGSYK CGLCQEFFG 119 

II lllhllh II I : I I 

Db 149 KAFIWSSVLIEHQRIHTGEKPYECSDCGKAFRGRSHFFRHLRTHTGEKPFACGACGKAFG 208 

Qy 12 0 LNNCSSSNRLDGHQRVH 136 

I : : I I I I I I 
Db 2 09 QSSQLIQHQRVH 22 0 


RESULT 7 
138599 

zinc finger protein ZNF134 - human 
C; Species: Homo sapiens (man) 

C;Date: 23-Feb-1996 #sequence_revision 23-Feb-1996 #text_change 05-Nov-1999 
C; Accession: 138599 
R;Tommerup, N. ; Vissing, H. 
Genomics 27, 259-264, 1995 

A; Title: Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs 

identify putative candidate genes for developmental and malignant disorders. 

A;Reference number: A57785; MUID: 96044430; PMID:7557990 

A; Accession: 13859 9 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-348 <RES> 

A;Cross-references : EMBL:U09412; NID:g488552; PIDN : AAC50253 . 1 ; PID:g488553 

C; Genetics : 

A; Gene : GDB : ZNF134 

A; Cross-references : GDB: 137 033 


A; Map position: 19ql3 . 4-19ql3 . 4 


Query Match 17.1%; Score 136.5; DB 2; Length 348; 

Best Local Similarity 27.3%; Pred. No. 8.2e-06; 

Matches 45; Conservative 13; Mismatches 46; Indels 61; Gaps 9; 

Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLC TRSQKEGLH YT 77 

lllll: : I : | M I : I I : I I I I : : I | 

Db 97 YKCSECGKA-FSRKDTLVQHQRIHS-GEKPYECSECGKAFSRKATLVQHQRIHTGERPYE 154 

Qy 78 CS SHFPYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLC 114 

M II ■ I I I : I I I I I I I I I I 

Db 155 CSECGKTFSRKDNLTQHKRIHTGEMPYKCNECGKYFSHHSNLIVHQRVHNGARPYKCSDC 214 

Qy 115 QEFF GLN--NCSSSNRLDG HQRVH 136 

: I I I : I I : I I I I : I 

Db 215 GKVFRHKSTLVQHESIHTGENPYDCSDCGKSFGHKYTLIKHQRIH 259 


RESULT 8 
139311 

Kruppel-type zinc finger protein ZNF74 - human 
C; Species: Homo sapiens (man) 

C;Date: 06-Sep-1996 #sequence_revision 06-Sep-1996 #text_change Ol-Dec-2000 
C;Accession: 139311; F42825; S37482 

R;Aubry, M. ; Demczuk, S.; Desmaze, C; Aikem, M. ; Aurias, A.; Julien, J.P.; 
Rouleau, G.A. 

Hum. Mol. Genet. 2, 1583-1587, 1993 

A;Title: Isolation of a zinc finger gene consistently deleted in DiGeorge 
syndrome . 

A;Reference number: 139311; MUID : 94093543 ; PMID: 8268910 
A;Accession: 139311 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-572 <RES> 

A;Cross-references: EMBL:X71623; NID : gl050529 ; PIDN : CAA50632 . 1 ; PID:gl050530 
A;Note: submitted to the EMBL Data Library, April 1993, revised 01-NOV-1995 
R;Aubry, M. ; Marineau, C; Zhang, F.R.; Zahed, L. ; Figlewicz, D. ; Delattre, O.; 
Thomas, G. ; de Jong, P.J.; Julien, J. P.; Rouleau, G.A. 
Genomics 13, 641-648, 1992 

A; Title: Cloning of six new genes with zinc finger motifs mapping to short and 
long arms of human acrocentric chromosome 22 (p and qll.2). 
A;Reference number: A42825; MUID: 92347859; PMID:1639391 
A; Accession: F42825 

A; Status: preliminary; not compared with conceptual translation 

A;Molecule type: nucleic acid 

A; Residues: 427-444, f V , 446-511 <AUB> 

A; Note :' sequence extracted from NCBI backbone (NCBIP: 109775) 

C; Superf amily : zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 17.0%; Score 135.5; DB 2; Length 572; 

Best Local Similarity 29.2%; Pred. No. 1.7e-05; 

Matches 33; Conservative 14; Mismatches 25; Indels 41; Gaps 5; 

Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHH SYKCGLCTRSQKEGLHYTCSSHFP 83 

hllll : I I I II hill I : I I I 
Db 288 YRCGECGKA FNQRTHLTRHHRIHTGEKPYQCGSC GKAFTCHS 329 


Qy 84 YSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

: I : I :: : I | : I I I :: I : I : I I I I I I 

Db 330 SLTVHEKIHSGDKPFKCSDCEKAF NSRSRLTLHQRTH 366 


RESULT 9 
148668 

zinc finger protein 51 - mouse 
N;Alternate names: finger protein zfecl2 
C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 08-Dec-2000 

C;Accession: 148668; A40984 

R; Burke, P.S.; Don, J.; Wolgemuth, D.J. 

Mamm. Genome 5, 387-389, 1994 

A; Title: Zfp-51, a murine zinc finger encoding gene mapping to the t-complex 
region of chromosome 17, encodes 19 contiguous zinc fingers and is ubiquitously 
expressed. 

A; Reference number: 148668; MUID : 94319090 ; PMID: 8043957 
A;Accession: 148668 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-710 <RES> 

A/Cross-references: EMBL:X74855; NID:g488832; PIDN : CAA52 8 47.1; PID:g488833 
R;Crossley, P.H.; Little, P.F.R. 

Proc. Natl. Acad. Sci. U.S.A. 88, 7923-7927, 1991 

A;Title: A cluster of related zinc finger protein genes is deleted in the mouse 
embryonic lethal mutation t(wl8). 

A; Reference number: A40984; MUID : 91376058 ; PMID: 1680234 

A; Accession: A4 098 4 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 672-710 <CRO> 

A; Cross-references : GB:M74235 

C; Genetics : 

A; Gene: Zfp-51 

C; Superf amily : zinc finger protein ZFP-36; LIM metal-binding repeat homology 
C; Keywords: DNA binding; zinc finger 

Query Match 16.8%; Score 134; DB 2; Length 710; 

Best Local Similarity 28.6%; Pred. No. 2.9e-05; 

Matches 38; Conservative 20; Mismatches 39; Indels 3 6; Gaps 7; 

Qy 9 IYDINYYTS EPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKC 63 

: I I : I : : I : I I I II I I : : : : : I I : I III 
Db 120 VYD-DYFSSAYSLMQPTAYIKEKPHQCGKCGKC FNTSSSLTVHHRIH-TKEKPYKC 173 

Qy 64 GLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNC 12 3 

: I : I : I : I I II I I I I I I I I I :: I 
Db 174 SVCDKS FTQCTH LKIHQRRHTGEKPYKCKECEKSF 208 

Qy 124 S S SNRLDGHQRVH 136 

: I I I : : I 
Db 209 VQLSALKSHQKLH 221 


RESULT 10 


S03679 

finger protein (clone mkr5) - mouse (fragment) 
C; Species: Mus mus cuius (house mouse) 

C;Date: 04-Dec-1992 #sequence_revision 04-Dec-1992 #text_change Ol-Dec-2000 
C;Accession: S03679 

R;Chowdhury, K. ; Rohdewohld, H.; Gruss, P. 
Nucleic Acids Res. 16, 9995-10011, 1988 

A;Title: Specific and ubiquitous expression of different Zn finger protein genes 
in the mouse. 

A; Reference number: S03677; MUID: 89057528 ; PMID:3143103 
A; Accession: SO 3 67 9 

A; Status: not compared with conceptual translation 
A; Molecule type: mRNA 
A; Residues: 1-475 <CHO> 

A;Note: the sequence in fig. 2 is inconsistent with that shown in fig.l in having 
64-Asn, 159-Trp, 187-trp, 188-trp, and in lacking a Met after 64-Gly and an Asp 
after 78-Ser 

C; Superf amily : zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 16.8%; Score 133.5; DB 2; Length 475; 

Best Local Similarity 27.8%; Pred. No. 2.2e-05; 

Matches 40; Conservative 24; Mismatches 49; Indels 31; Gaps 8; 

Qy 15 YTSEPCQK INVKQ IAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYK 62 

I I I I I : I I | : | : | | : :: I I I I I I : I 

Db 22 3 YTCEVCHKSFRYGSSLTVHQRIHTGEKPYECEICRKA-FSHHASLTQHQRVHS-GEKPFK 2 80 

Qy 63 CGLCTRSQKEGLHYTCSSHF PYSQYQFWKNF QTLKIHQRVHGGGGSYKCG 112 

I I : : : : : I : | | : I : : | : | | I I I : I I :: | 

Db 281 CKECGKAFRQNIH — LASHWRIHTGEKPFECGECGKSFSISSQLATHQRIHTGEKPFECK 338 

Qy 113 LCQEFFGLNNCSSSNRLDGHQRVH 136 

: I : : I I I I hi 

Db 339 VCRKAFRQN IHLASHWRIH 357 


RESULT 11 
148689 

gene NK10 protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 01-Dec-2000 
C;Accession: 148689; S49078 

R;Lange, R. ; Christoph, A.; Thiesen, H.J.; Vopper, G.; Johnson, K.R.; Lemaire, 
L.; Plomann, M. ; Cremer, H.; Barthels, D. ; Heinleim, U.A.O. 
DNA Cell Biol. 14, 971-981, 1995 

A; Title: Developmentally regulated mouse gene NK10 eucodes a xZInc Finger 
Repressor Protein with differential NDA-Binding Domains. 
A;Reference number: 148689; MUID : 96069544 ; PMID:7576184 
A; Accession: 14 8 689 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-636 <RES> 

A;Cross-references: EMBL:X79828; NID:g506501; PIDN : CAA56225 . 1 ; PID:g506502 
C; Superf amily : zinc finger protein ZFP-36; LIM metal-binding repeat homology 


Query Match 

Best Local Similarity 


16. 8%; 
31.6%; 


Score 133.5; DB 2; Length 636; 
Pred. No. 2.9e-05; 


Matches 


37; Conservative 15; Mismatches 44; Indels 21; Gaps 


6; 


Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRS QKEGLHYTCSSHF 82 

1:1111: : : : I I I I : | I : I I I I I I I I I 
Db 306 YQCSLCGKA-FQRSSSLVQHQRIH-TGEKPYRCNLCGRSFRHSTSLTQHEVTH SGEK 360 

Qy 83 PYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

I : : I I : I hill : : I : I I I I I I hi 

Db 361 PFQCKECGKAFSRCSSLVQHERTHTGEKPFECSICGRAFG QSPSLYKHMRIH 412 


RESULT 12 
T14757 

hypothetical protein DKFZp572Cl63 . 1 - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change Ol-Dec-2000 
C; Accession: T14757 

R;Wambutt, R. ; Heubner, D.; Mewes, H.W. ; Gassenhuber, J.; Wiemann, S. 

submitted to the Protein Sequence Database,, August 1999 

A; Reference number: Z18181 

A;Accession: T14757 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-701 <WAM> 

A; Cross-references : EMBL : ALII 02 17 

A; Experimental source: adult subthalamic nucleus; clone DKFZp572C163 
C; Genetics : 

A;Note: DKFZp572C163 . 1 

C; Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 16.8%; Score 133.5; DB 2; Length 701; 

Best Local Similarity 23.4%; Pred. No. 3.2e-05; 

Matches 41; Conservative 16; Mismatches 37; Indels 81; Gaps 8; 

Qy 31 YKCGLCAAAQWDFGNT MCQHQRVH GHHH-HS- 60 

hi I II :| I II-: I III: 

Db 468 YECNQC GKTFSQRTHLCAHQRIHTGEKPYECNECGKTFADNSALRAHHRIHTG 520 

Qy 61 YKCGLCTRSQKEGLHY TCSSHFPYSQYQFWKNF 93 

I : I I : : : I II II : I I 

Db 521 EKPYECNDCGKTFSKTSHLRAHLRTRSGEKPYECSECGKTFSEKSYVSAHQRVHTGEKPY 580 

Qy 94 QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

I I : : I I I : I I I I : I I : I I : I I I I : I 

Db 581 ECNVCGKPFAHNSTLRVHQRIHTGEKSYECNDCGKTF SQKSHLSAHQRIH 630 


RESULT 13 
A34612 

zinc finger protein ZNF7 - human 

N; Alternate names: zinc finger protein kox4 

C; Species: Homo sapiens (man) 

C;Date: 22-Jun-1990 #sequence_revision 22-Jun-1990 #text__change Ol-Dec-2000 
C;Accession: A34612; A56409; S10421; 137972 

R;Lania, L.; Donti, E. ; Pannuti, A.; Pascucci, A.; Pengue, G. ; Feliciello, I.; 
La Mantia, G. ; Lanfrancone, L . ; Pelicci, P.G. 
Genomics 6, 333-340, 1990 


A;Title: cDNA isolation, expression analysis, and chromosomal localization of 
two human zinc finger genes . 

A; Reference number: A34612; MUID: 90169993 ; PMID:2106481 
A; Accession: A3 4 612 
A;Molecule type: mRNA 
A; Residues: 1-686 <LAN> 

A;Cross-references: GB:M29580; NID:g340445; PIDN :AAA61313 . 1 ; PID:g340446; 
GB: J04751 

R;Bray, P.; Lichter, P.; Thiesen, H.J.; Ward, D.C.; Dawid, I.B. 
Proc. Natl. Acad. Sci. U.S.A. 88, 9563-9567, 1991 

A; Title: Characterization and mapping of human genes encoding zinc finger 
proteins . 

A; Reference number: A56409; MUID : 92052 132 ; PMID: 1946370 

A;Accession: A56409 

A;Molecule type: DNA 

A; Residues: 425-589 <BRA> 

A; Cross-references : GB:M77170 

R;Thiesen, H.J. 

submitted to the EMBL Data Library, March 1990 

A; Reference number: S10397 

A; Access ion: SI 04 21 

A; Molecule type: mRNA 

A; Residues: 413-468 <THI> 

A;Cross-references: EMBL : X52335 ; NID:g34165; PIDN : CAA36561 . 1 ; PID:g930095 

R;Thiesen, H.J. 

New Biol. 2, 363-374, 1990 

A;Title: Multiple genes encoding zinc finger domains are expressed in human T 
cells . 

A; Reference number: 137949; MUID : 9114 5339 ; PMID:2288909 
A; Access ion: 137 972 

A; Status: translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A; Residues: 413-468 <RES> 

A;Cross-references: EMBL:X52335; NID:g34165; PIDN : CAA36561 . 1 ; PID:g930095 

C; Genetics : 

A; Gene: GDB : ZNF7 

A;Cross-references: GDB: 120509; OMIM: 194531 
A; Map position: 8q24 . 3-8q24 . 3 

C; Super family: zinc finger protein ZFP-36; LIM metal-binding repeat homology 
C;Keywords: DNA binding; zinc finger 

Query Match 16.5%; Score 131.5; DB 2; Length 686; 

Best Local Similarity 26.9%; Pred. No. 4.8e-05; 

Matches 45; Conservative 11; Mismatches 46; Indels 65; Gaps 8; 

Qy 31 YKCGLCAAAQWDFG — NTMCQHQRVHGHHHHSYKCGLCTRSQKEGLH 75 

Ml I I II : : : I I I I : I I I : : I I ' 

Db 441 YKCNKCTKA FGCSSRLIRHQRTH-TGEKPFKCDECGKGFVQGSHLIQHQRIHTGEKP 496 

Qy 76 YTCS SHFPYSQYQFWKNFQ TLKIHQRVHGGGGSYKCG 112 

| | : II I II I I II II I I Ml 

Db 497 YVCNDCGKAFSQSSSLIYHQRIHKGEKPYECLQCGKAFSMSTQLTIHQRVHTGERPYKCN 556 


Qy 113 LCQEFFGLNN CSS SNRLDGHQRVH 136 

| : I I : II I : I I I I : I 

Db 557 ECGKAFSQNSTLFQHQIIHAGVKPYECSECGKAFSRSSYLIEHQRIH 603 


RESULT 14 
S70006 

finger protein zfOCl - guinea pig 

C; Species: Cavia porcellus (guinea pig) 

C;Date: 06-Dec-1996 #sequence_revision 13-Mar-1997 #text_change 05-Nov-1999 
C;Accession: S70006 

R;Rivolta, M.N.; Negrini, C; Wilcox, E.R. 
Biochim. Biophys. Acta 1306, 127-132, 1996 

A; Title: A novel zinc finger gene preferentially expressed in the retina and the 

organ of Corti localizes to human chromosome 12q24.3. 

A; Reference number: S70006; MUID: 96221281 ; PMID: 8634327 

A; Accession: S70006 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-261 <RIV> 

A;Cross-references: EMBL:L26335; NID : g!237277 ; PIDN : AAC42091 . 1 ; PID:gl237278 
C; Keywords: zinc 


Query Match 16.4%; Score 131; DB 2; Length 261; 

Best Local Similarity 28.8%; Pred. No. 2.1e-05; 

Matches 42; Conservative 16; Mismatches 44; Indels 44; 


Gaps 


7; 


Qy 

Db 


2 6 KQIAAYKCG- 


LCAAAQWDFGNT — MCQHQRVHGHHHHSY 61 

| | : | | : I : I I :: I I I I I I ' 

85 KPYKCYECGKAFNWS PHLQIHMRVHTGEKPYVCSECGRGFSNS SNLCMHQRVH-TGEKPF 143 


Qy 

Db 


62 KCGLCTRS QKEGLHYTCSSHFPYSQYQFWKNF QTLKIHQRVHGGGGSYK 110 

|||:: : : I : II I : II : I I I I I I I I I : 

144 KCEECGKAFRHTSSLCMHQRVH TGEKPYKCYECGKAFSQSSSLCIHQRVHTGEKPYR 200 


Qy 

Db 


111 CGLCQEFFGLNNCSSSNRLDGHQRVH 136 

I I : I I I : I I I II I 

201 CCGCGKAF SQSSSLCIHQRVH 221 


RESULT 15 
JH0500 

zinc finger protein 29 - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 31-Mar-1992 #sequence_revision 31-Mar-1992 #text_change 08-Dec-2000 

C; Accession: JH0500 

R; Denny, P.; Ashworth, A. 

Gene 106, 221-227, 1991 

A; Title: A zinc finger protein-encoding gene expressed in the post-meiotic phase 
of spermatogenesis. 

A; Reference number: JH0500; MUID : 92039080 ; PMID: 1937051 
A; Accession: JH0500 
A;Molecule type: mRNA 
A; Residues: 1-614 <DEN> 

A/Cross-references: GB:X55126; NID:g55470; PIDN: CAA38 920 . 1; PID:g55471 

A; Experimental source: testis 

C; Keywords: DNA binding; zinc finger 

F; 2 17-2 4 4 / Region : zinc finger 

F;245-272/Region: zinc finger 

F; 2 7 3- 3 00/ Region : zinc finger 

F; 3 01-32 8 /Region : zinc finger 


F;329- 
F;357- 
F;385- 
F;413- 
F;441- 
F;469- 
F;497- 
F;525- 
F;553- 
F;581- 


-356/Region : 
-384/Region: 
-412/Region: 
-440/Region: 
-468/Region: 
-496/Region: 
-524/Region: 
-552/Region: 
-580/Region: 
-608/Region: 


zinc 
zinc 
zinc 
zinc 
zinc 
zinc 
zinc 
zinc 
zinc 
zinc 


finger 
finger 
finger 
finger 
finger 
finger 
finger 
finger 
finger 
finger 


Query Match 16.2%; 
Best Local Similarity 30.3%; 
Matches 37; Conservative 


Score 129.5; DB 2; 
Pred. No. 6.8e-05; 
12; Mismatches 42; 


Length 614; 
Indels 31; 


Gaps 


6; 


QY 
Db 

QY 
Db 


31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRS QKEGLH 75 

I : || I : : : I : I I Mill!: : I I 

418 YQCGEC-GKNFSRSSNLATHRRTH-LVEKPYKCGLCGKSFSQSSSLIAHQGTHTGEKPYE 4 75 

7 6 -YTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQR 134 
II |:| I I I I I I I : I I I : I I : : I I I I 

47 6 CLTCGESFSWS SNLI KHQRTHTGEKP YRCGDCGKGF SQRSQLWHQR 522 


Qy 135 VH 136 

I 

Db 523 TH 524 


Search completed: September 17, 2004, 16:49:56 
Job time : 23.4542 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 
Run on : 


Title: 

Perfect score: 
Sequence: 

Scoring table: 


September 17, 2004, 16:48:23 ; Search time 83.7356 Seconds 

(without alignments) 
529.242 Million cell updates/sec 

US-10-057-890A-10 
797 

1 MDYQVSSPI YDINYYTSEPC GLNNCSSSNRLDGHQRVHAA 138 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 


1342398 


Searched: 1342398 seqs, 321133274 residues 

Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


Database : Published_Applications_AA: * 

1: /cgn2j5/ptodata/2/pubpaa/US07_PUBCOMB.pep:* 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3: /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep:* 

4: /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep:* 

5: /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep:* 

6 : / cgn2_6/pt oda t a/ 2 /pubpaa/ PCTUS_PUBCOMB . pep : 

7: /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 

8: /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: 

10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep 

11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep 

12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: 

13: /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep 

14 : /cgn2_6/ptodata/2/pubpaa/USl0B_PUBCOMB.pep 

15: /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep 

16: /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: 

17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: 

18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 


Result Query 

No. Score Match Length DB 


ID 
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14 
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352 

14 
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Sequence 2, Appli 
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5 

352 

13 

US-10-106-623-20 

Sequence 20, Appl 

41 

359 

45. 

0 

352 

12 

US-10-151-274-5 

Sequence 5, Appli 

42 

258 

32. 

4 

184 

9 

US-09-938-719-4 

Sequence 4, Appli 

43 

258 

32. 

4 
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9 

US-09-939-226-4 

Sequence 4 , Appli 

44 

258 

32. 
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184 

9 

US-09-938-703-4 

Sequence 4, Appli 

45 

258 

32. 

4 

184 

16 

US-10-661-798-4 

Sequence 4, Appli 


ALIGNMENTS 


RESULT 1 

US-10-057-890A-10 

; Sequence 10, Application US/10057890A 
; Publication No. US20030044901A1 
; GENERAL INFORMATION : 

APPLICANT: Coleman, Timothy 


; APPLICANT: Mansfield, Brian 

TITLE OF INVENTION: Scaffold Fusion Polypeptides, Composition for Making the 
Same, and Methods 

TITLE OF INVENTION: of Using the Same. 
FILE REFERENCE: PF537 

CURRENT APPLICATION NUMBER: US/10/057, 8 9 OA 
CURRENT FILING DATE: 2002-01-29 
PRIOR APPLICATION NUMBER: 60/265,782 
PRIOR FILING DATE: 2001-01-31 
PRIOR APPLICATION NUMBER: 60/265,858 
PRIOR FILING DATE: 2001-01-31 
NUMBER OF SEQ ID NOS : 32 
SEQ ID NO 10 
LENGTH: 138 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-057-890A-10 

Query Match 100.0%; Score 797; DB 14; Length 138; 

Best Local Similarity 100.0%; Pred. No. 5.6e-71; 

Matches 138; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 

I I 1 II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 1 I I I I I I I I I I I I 
MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 

YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 12 0 

M I I M I I I II I I I I I II I I I II I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I II I I I 

YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 12 0 

NNCSSSNRLDGHQRVHAA 138 

I I I I I I I I I I I I II I I I I 
NNCSSSNRLDGHQRVHAA 138 


Qy 

1 

Db 

1 

Qy 

61 

Db 

61 

Qy 

121 

Db 

121 


RESULT 2 

US-10-057-890A-31 

; Sequence 31, Application US/10057890A 

; Publication No. US20030044901A1 

; GENERAL INFORMATION: 

; APPLICANT: Coleman, Timothy 

; APPLICANT: Mansfield, Brian 

; TITLE OF INVENTION: Scaffold Fusion Polypeptides, Composition for Making the 
Same, and Methods 

; TITLE OF INVENTION: of Using the Same. 

FILE REFERENCE: PF537 
; CURRENT APPLICATION NUMBER: US/ 10/ 057 , 8 9 OA 

CURRENT FILING DATE: 2002-01-29 
; PRIOR APPLICATION NUMBER: 60/265,782 
; PRIOR FILING DATE: 2001-01-31 
; PRIOR APPLICATION NUMBER: 60/265,858 
; PRIOR FILING DATE: 2001-01-31 
; NUMBER OF SEQ ID NOS: 32 
; SEQ ID NO 31 
LENGTH: 157 
TYPE: PRT 
; ORGANISM: Homo sapiens 


US-10-057-890A-31 


Query Match 100.0%; Score 797; DB 14 

Best Local Similarity 100.0%; Pred. No. 6.4e-71 
Matches 138; Conservative 0; Mismatches 0 


QY 
Db 

Qy 
Db 

Qy 

Db 


Length 157; 

Indels 0; Gaps 0; 

60 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 

I I I I II II I I I I I I I I ! I I I I I I I I II I I I I I I I I I I I I I 1 I I 

20 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 7 9 

61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 12 0 

I I M I I II I I I I I I I I I I I I II I I I I I I I I I II I I I I I I II I I II I I I I I I I I I I I I I I I 
80 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 139 

121 NNCS S SNRLDGHQRVHAA 138 

I I I I I I I I I I I I I I I I I I 
140 NNCS S SNRLDGHQRVHAA 157 


RESULT 3 
US-09-725-285-2 

Sequence 2, Application US/09725285 
Patent No. US20010000241A1 
GENERAL INFORMATION: 
APPLICANT: Li, Yi 
APPLICANT: Ruben, Steven, M. 

TITLE OF INVENTION: Antibodies to Human G-Protein Chemokine Receptor 
HDGNR10 

TITLE OF INVENTION: (CCR5 Receptor) 
FILE REFERENCE: 1488.1150003 
CURRENT APPLICATION NUMBER: US/09/725, 285 
CURRENT FILING DATE: 2000-11-29 
PRIOR APPLICATION NUMBER: 09/339,912 
PRIOR FILING DATE: 1999-06-25 
PRIOR APPLICATION NUMBER: 09/195,662 
PRIOR FILING DATE: 1998-11-18 
PRIOR APPLICATION NUMBER: 08/466,343 
PRIOR FILING DATE: 1995-06-06 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 2 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Artificial Sequence: Genomic 
FEATURE : 

OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-725-285-2 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 


7; 


Qy 

Db 

Qy 


30 


1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


31 


-YKCGLC- 


-AAAQWDFGNTMCQHQRVHGHHHHSYKCG- 


64 


Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I DRYLAWHAVFALKART VT FGWT S VI TWWAVFAS L P GI I FT RS Q KE GLH YTC 178 

Qy 79 S SHFPYSQYQFWKNFQTLKI HQRVHGG ■ — 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 S SHFPYSQYQFWKNFQTLKIVI LGLVLPLL VTVTVICYSGILKTLLRCRNEKKRHRAVRLI F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 4 
US-09-759-841-2 

; Sequence 2, Application US/09759841 

; Patent No. US20010039026A1 

; GENERAL INFORMATION: 

; APPLICANT: Rickett, Graham A 

; APPLICANT: Dobbs, Susan 

; APPLICANT: Perros, Manoussos 

; TITLE OF INVENTION: Assay Method 

; FILE REFERENCE: PC10348APME 

; CURRENT APPLICATION NUMBER: US/09/759, 841 

CURRENT FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: GB 0000661.9 
; PRIOR FILING DATE: 2000-01-12 
; PRIOR APPLICATION NUMBER: GB 0000663.5 
; PRIOR FILING DATE: 2000-01-12 
; PRIOR APPLICATION NUMBER: GB 0000659.3 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 6 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 2 

LENGTH: 352 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-759-841-2 


Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 64 

I I I I I I I I I I II II I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 65 LCTRSQKEGLHYTC 7 8 

: I 1 I I I I I I I I I I 

Db 119 1 1 L LT I DRYLAWHAVFAL KART VT FGWT S VI TWWAVFAS LPGIIFTRSQ K EG LH YT C 178 


Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I M I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 5 

US-09-779-879A-22 

Sequence 22, Application US/09779879A 
Patent No. US2 0020048786A1 
GENERAL INFORMATION: 
APPLICANT: Rosen, Craig A. 
APPLICANT: Roschke, Viktor 
APPLICANT: Li, Yi 
APPLICANT: Ruben, Steven, M. 

TITLE OF INVENTION: Human G-protein Chemokine Receptor (CCR5) HDGNR10 
FILE REFERENCE: 1488.115000A 

CURRENT APPLICATION NUMBER: US/09/779 , 87 9A 
CURRENT FILING DATE: 2001-02-09 
PRIOR APPLICATION NUMBER: US 60/181,258 
PRIOR FILING DATE: 2000-02-09 
PRIOR APPLICATION NUMBER: US 60/187,999 
PRIOR FILING DATE: 2000-03-09 
PRIOR APPLICATION NUMBER: US 60/234,336 
PRIOR FILING DATE: 2000-09-22 
NUMBER OF SEQ ID NOS : 58 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 22 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-779-879A-22 

Query Match 47.6%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I 11 I I I I M I M I I 

MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

YKCGLC -AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I II I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

IILLTIDRYIAV^AVFALKARTvTFGVA^SVITWVVAVFASLPGIIFTRSQKEGLHYTC 17 8 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

II I II I II I II I II I I I I I I : : I 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 


Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I II I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 

US-09-779-880A-22 

Sequence 22, Application US/09779880A 
Patent No. US20020061834A1 
GENERAL INFORMATION: 
APPLICANT: Rosen, Craig A. 
APPLICANT: Roschke, Viktor 
APPLICANT: Li, Yi 
APPLICANT: Ruben, Steven, M. 

TITLE OF INVENTION: Human G-protein Chemokine Receptor (CCR5) HDGNR10 
FILE REFERENCE: 1488.115000C 

CURRENT APPLICATION NUMBER: US/ 09/7 7 9 , 8 8 OA 
CURRENT FILING DATE: 2001-02-09 
PRIOR APPLICATION NUMBER: US 60/181,258 
PRIOR FILING DATE: 2000-02-09 
PRIOR APPLICATION NUMBER: US 60/187,999 
PRIOR FILING DATE: 2000-03-09 
PRIOR APPLICATION NUMBER: US 60/234,336 
PRIOR FILING DATE: 2000-09-22 
NUMBER OF SEQ ID NOS : 58 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 22 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-779-880A-22 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

[DYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

[DYQVS S PI YDINYYT S EPCQKINVKQIAARLLP PLYSLVFI FGFVGNMLVILI LINCKR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


-LCTRSQKEGLHYTC 78 
: I I I I I I I I I I I I 


SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I II I I I I I I II II I I I I = : I 


I I I I I I I I I I II I I I I : I 


RESULT 7 

US-09-813-653-15 

Sequence 15, Application US/09813653 
Patent No. US20020064770A1 
GENERAL INFORMATION: 
APPLICANT: Nestor, John 
APPLICANT: Wilson, Carol 
APPLICANT: See, Raymond 
APPLICANT: Tan Hehir, Christina 

TITLE OF INVENTION: Binding Compounds and Methods For Identifying Binding 
Compounds 

FILE REFERENCE: CNS-005 

CURRENT APPLICATION NUMBER: US/ 0 9/ 8 13 , 653 
CURRENT FILING DATE: 2001-03-20 
PRIOR APPLICATION NUMBER: US 60/190,946 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: US 60/190,996 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: US 60/191,299 
PRIOR FILING DATE: 2000-03-21 
NUMBER OF SEQ ID NOS : 44 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 15 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-813-653-15 

Query Match 47.6%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I II I I I II I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I II I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I II I I II I I II I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 8 

US-09-813-653-17 

; Sequence 17, Application US/09813653 
; Patent No. US2002 0064770A1 


; GENERAL INFORMATION: 

; APPLICANT: Nestor, John 

; APPLICANT: Wilson, Carol 

; APPLICANT: See, Raymond 

; APPLICANT: Tan Hehir, Christina 

; TITLE OF INVENTION: Binding Compounds and Methods For Identifying Binding 
Compounds 

; FILE REFERENCE: CNS-005 

; CURRENT APPLICATION NUMBER: US/ 0 9/ 8 13 , 653 

; CURRENT FILING DATE: 2001-03-20 

; PRIOR APPLICATION NUMBER: US 60/190,946 

; PRIOR FILING DATE: 2000-03-21 

; PRIOR APPLICATION NUMBER: US 60/190,996 

; PRIOR FILING DATE: 2000-03-21 

; PRIOR APPLICATION NUMBER: US 60/191,299 

; PRIOR FILING DATE: 2000-03-21 

; NUMBER OF SEQ ID NOS : 44 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 17 

LENGTH: 352 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-813-653-17 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1 MD YQ VS S P I YD I N Y YT S E P CQKI N VKQ I AARLL P P L YS LVFI FG FVGNMLVT L I Q I NCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I II I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I II I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I II I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 9 
US-09-796-202-1 

; Sequence 1, Application US/09796202 

; Patent No. US20020068813A1 

; GENERAL INFORMATION: 

; APPLICANT: Dragic, Tat j ana 

; APPLICANT: Olson, William 

; TITLE OF INVENTION: SULFATED CCR5 PEPTIDES FOR HIV-1 INFECTION 


; FILE REFERENCE: 2 048/61010/ JPW/SHS 

; CURRENT APPLICATION NUMBER: US/09/796,2 02 

; CURRENT FILING DATE: 2001-02-28 

; NUMBER OF SEQ ID NOS : 17 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 1 

LENGTH: 352 
; TYPE: PRT 

ORGANISM: human 
US-09-796-202-1 


Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 3 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 


Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I M II I I I I I II I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I M I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 10 
US-09-195-662A-2 

; Sequence 2, Application US/09195662A 

; Patent No. US2002007 6745A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

TITLE OF INVENTION: Human G-Protein Chemokine Receptor HDGNR10 (CCR5 
Receptor) 

; FILE REFERENCE: 1488.1150002 

; CURRENT APPLICATION NUMBER: US/ 0 9/ 195 , 662A 

CURRENT FILING DATE: 1998-11-18 
; PRIOR APPLICATION NUMBER: 08/466,343 
; PRIOR FILING DATE: 1995-06-06 
; NUMBER OF SEQ ID NOS: 9 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 2 
; LENGTH: 352 
TYPE: PRT 

; ORGANISM: Artificial Sequence: Genomic 
FEATURE : 


OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-195-662A-2 


Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I II I I I I I I I I I I I I I I I I I I I I I I I II i I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I II I I 

Db 119 IILLTIDRYI^WHAVFALKARTW 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I II I I I I I I : : I 

Db 179 S SHFP YSQYQFWKNFQTLKI VI LGLVLPLLVMVI CYSGI LKTLLRCRNEKKRHRAVRLI F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
US-09-339-912A-2 

; Sequence 2, Application US/09339912A 

; Patent No. US20020099176A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

TITLE OF INVENTION: Antibodies to Human G-Protein Chemokine Receptor 
HDGNR10 

; TITLE OF INVENTION: (CCR5 Receptor) 

; FILE REFERENCE: 1488.1150003 

; CURRENT APPLICATION NUMBER: US/ 09/339 , 9 12 A 

; CURRENT FILING DATE: 1999-06-25 

; PRIOR APPLICATION NUMBER: 09/195,662 

; PRIOR FILING DATE: 1998-11-18 

; PRIOR APPLICATION NUMBER: 08/466,343 

; PRIOR FILING DATE: 1995-06-06 

; NUMBER OF SEQ ID NOS : 9 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 2 
; LENGTH: 352 
TYPE: PRT 

; ORGANISM: Artificial Sequence: Genomic 
; FEATURE : 

; OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-339-912A-2 


Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 


Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


Qy 1 MD YQVS S P I YD IN YYT S E PCQKINVKQ I AA ■ 30 

I II II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : I :: I 

Db 61 LKSMTDI YLLNLAI SDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I II I 

Db 119 IILLTIDRYLAWHAVFALKARTVTFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVTFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 12 
US-09-938-719-5 

Sequence 5, Application US/09938719 
Patent No. US2002 0106742A1 
GENERAL INFORMATION : 

APPLICANT: SAMSON, MICHEL 

PARMENTIER, MARC 
VASSART, GILBERT 
LIBERT, FREDERICK 
TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 

AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/938 , 719 
FILING DATE: 24-Aug-2001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 
FILING DATE: 27-JULY-2000 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 


; REFERENCE/ DOCKET NUMBER: <Unknown> 

; INFORMATION FOR SEQ ID NO: 5: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 352 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-938-719-5 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 64 

II I I I I I I I I II I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 1 1 LL T I D R Y LAWHAVFAL KART VT FGWT S VI TWWAVFAS L P G 1 1 FT RS QK E GLH YT C 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I II I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 13 
US-09-939-226-5 

; Sequence 5, Application US/09939226 
; Patent No. US20020110805A1 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 

PARMENTIER, MARC 
VASSART, GILBERT 
LIBERT, FREDERICK 
TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
; AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 

NUMBER OF SEQUENCES: 17 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
; CITY: Newport Beach 

STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 


OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/939, 22 6 
FILING DATE: 24-Aug-2001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 
FILING DATE: 2000-07-27 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/DOCKET NUMBER: <Unknown> 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-939-226-5 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I II I I I I I I I I I I I I I II I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I M I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I II I I I I I 

Db 119 1 1 LLTI DRYLAVVHAVFAIjKARTWFGVVTSVI TWWAVFASLPGI I FTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I II I I I II I I I I I I I I I I I I 
Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 14 
US-09-938-703-5 

; Sequence 5, Application US/09938703 
; Patent No. US20020110870A1 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 

PARMENTIER, MARC 
; VASSART, GILBERT 

LIBERT, FREDERICK 
TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 


AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/938,7 03 
FILING DATE: 24-Aug-2001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 
FILING DATE: 2000-07-27 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/ DOCKET NUMBER: <Unknown> 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-938-703-5 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I II I I 1 I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I II I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 : LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I II 

Db 119 IILLTIDRYLAVAmAVFALKARTWFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I II I I I I I II I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I II I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 15 
US-09-502-783A-2 

; Sequence 2, Application US/09502783A 

; Patent No. US20020132269A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven M. 

; TITLE OF INVENTION: Polynucleotides Encoding Human G-Protein Chemokine 
Receptor (CCR5) 

; TITLE OF INVENTION: HDGNR10 
; FILE REFERENCE: 1488.1150006 

; CURRENT APPLICATION NUMBER: US/ 09/502 , 783A 

; CURRENT FILING DATE: 2001-08-23 

; PRIOR APPLICATION NUMBER: 08/466,343 

; PRIOR FILING DATE: 1995-06-06 

; NUMBER OF SEQ ID NOS: 9 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 2 

LENGTH: 352 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-502-783A-2 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 
Qy 1 MDYQVS S P I YDINYYTS EPCQKINVKQI AA 30 



Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Db 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 


Qy 


Db 


65 LCTRSQKEGLHYTC 7 8 

: I I I I II I I I I I I 

119 IILLTIDRYLAVAmAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 


QY 


79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 



Db 


17 9 S SHFPYSQYQFWKNFQTLKIVI LGLVLPLLVMVI CYSGI LKTLLRCRNEKKRHRAVRLI F 238 


Qy 


Db 



Search completed: September 17, 2004, 17:12:52 
Job time : 86.7356 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search,, using sw model 

Run on: September 17, 2004, 16:42:06 ; Search time 66.8949 Seconds 

(without alignments) 
650.894 Million cell updates/sec 


Title: US-1 0-057-8 90A-10 

Perfect score: 797 

Sequence: 1 MDYQVSSPIYDINYYTSEPC GLNNCSSSNRLDGHQRVHAA 138 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 


Searched: 


1017041 seqs, 315518202 residues 


Total number of hits satisfying chosen parameters: 


1017041 


Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


Database : SPTREMBL 25:* 


1 


sp_archea : * 

2 


sp bacteria : * 

3 


sp fungi:* 

4 


sp human : * 

5 


sp invertebrate:* 

6 


sp mammal : * 

7 


sp mhc : * 

8 


sp organelle:* 

9 


sp_phage : * 

10: 

sp plant:* 

11 

sp rodent:* 

12 

sp virus:* 

13 

sp vertebrate:* 

14 

sp unclassified:* 

15 

: sp rvirus:* 

16 

: sp bacteriap: * 

17 

: sp_archeap:* 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 


SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 


DE 
GN 
OS 

oc 
oc 
ox 

RN 
RP 
RC 
RA 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
SQ 


Huang Y. , Cao Y. , Wang G. , Hahn B., 


CCR5 receptor (Fragment) . 
CCR5. 

Pan troglodytes (Chimpanzee) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 
NCBI_TaxID=9598; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=ChCCR5-142a; 
Zhang L., Carruthers CD., He 1 
Ho D . D . ; 

"HIV-l subtypes, co-receptor usage, and CCR5 polymorphism."; 
AIDS Res. Hum. Retroviruses 0:0-0(1997). 
EMBL; AF011541; AAB65741.1; 

C: integral to membrane; IEA. 
F: receptor activity; IEA. 
F: rhodopsin-like receptor activity; IEA. 
P:G-protein coupled receptor protein signalin. 
InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR002 37; GPCRRHODOPSN . 
PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 
PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 
Receptor . 

NON_TER 352 352 

SEQUENCE 352 AA; 40598 MW; 39688FA7004C952F CRC64; 


GO; GO:0016021; 

GO; GO:0004872; 

GO; GO:0001584; 

GO; GO: 0007186; 


IEA. 


1. 
1. 


Query Match 46.8%; 
Best Local Similarity 34.6%; 
Matches 98; Conservative 


Score 373; DB 6; Length 352; 
Pred. No. 1.9e-33; 
8; Mismatches 27; Indels 150; 


Gaps 


7; 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I : I I I I I ! I I I I I I I I M I 

MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVDNMLVILILINCKR 60 


I | I I I I II I I I I I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


-LCTRSQKEGLHYTC 7 8 
: I I I I I II I I I I I 


-HQRVHGG- 


105 


I I I I I I I II I II I I I I I II I 


-GGSYKCGLC- 
I I 


-QEFFGLNNCSSSNRLDGHQRV 135 
I I I I I I I I I 1 I I I I I I : I 


RESULT 2 
Q9TQR8 

ID Q9TQR8 PRELIMINARY; PRT; 344 AA. 

AC Q9TQR8; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 


1 

X 

?7 ^ 
j / j 

4 6 8 

352 

6 

018772 

018772 

pan troglod 

9 

Z 

37 0 

46.4 

344 

6 

Q9TQR8 

Q9tqr8 

cercocebus 

O 

J / U 

4 6.4 

344 

6 

077833 

077833 

cercocebus 

A 

j \j z> 

4 6 ^ 

352 

6 

095NC5 

Q95nc5 

hylobates s 

c; 

J 


4 6.3 

352 

6 

Q9TV5 0 

Q9tv50 

pan troglod 

c. 

D 

367 

46.0 

352 

6 

Q95NC3 

Q95nc3 

miopithecus 

7 

367 

4 6.0 

352 

6 

018771 

018771 

pan troglod 

□ 

o 


45 . 9 

352 

6 

Q9TSK1 

Q9tskl 

cercopithec 

q 

J \J D 

45 . 9 

352 

6 

Q9TV4 9 

Q9tv49 

cercocebus 

1 n 

j d d 

45 . 9 

352 

6 

Q9XT13 

Q9xtl3 

papio anubi 

-L X 

j d u 

45 . 9 

352 


Q9TV45 

Q9tv45 

cercopithec 

1 9 
XZ 

ODD 

1 J • Z7 

352 


09 5NK8 

\S _/ ^ X 1 ! i-J w 

Q95ne8 

cercopithec 

J. J 

J D J 

45.8 

352 

6 

Q9XT14 

Q9xtl4 

colobus gue 

1 4 

X fx 

^ fi S 

4 S ft 

J J 

5 

095NC6 

^ JIN ^« VJ 

Q95nc6 

t rachypithe 

-L J 

J vJ J 

*± J • o 

352 

6 

095NC8 

Q95nc8 

colobus pol 

1 fi 

^64 

4 S 7 
*± j * / 

352 


Q9XS99 

Q9xs 99 

gorilla gor 

1 7 

^fi^ 

J D J 

45.5 

352 


097975 

097975 

macaca arct 

1 Q 
X 0 

j Dj 

ft J ♦ J 

^ S? 

J J 


OQMZ A? 

Q9mza2 

cercopithec 

1 Q 
X y 

^ fi9 

4 S 4 

IS? 

J J £. 


095Mri 

Q95ncl 

the ropi thee 

z u 

"3 f^9 
j DZ 

4 4 

fx J • ft 

^S? 

J J 



O95nc0 

hylobates m 

Z X 

^ £9 
■j DZ 

4 ^ 4 

ft J • fx 

^S? 

J J £. 

D 

Q9TV4 6 

Q9tv46 

cercopithec 

9 9 
Z Z 

j DZ 

4 ^ 4 

fx J . fx 

J J 

fi 

097962 

097962 

pygathrix a 

9 ^ 
Z j 

^ fil 

4 S ^ 

fx J • J 

J J 



Q95nc7 

nasalis lar 

9 /l 
Z 4 

JOl 

4 R ^ 

fx J ■ J 

^ s? 

J J 

fi 

D 

Q9TV42 

Q9tv42 

cercopithec 

9 ^ 
Z j 

jDI 

4 S ^ 
i j . j 

352 


07777 6 

077776 

cercocebus 

9 £ 
Z 0 

^ fil 

fx J . J 

j jz. 

5 

^7 Jli DX 

Q95nel 

ce r cocebus 

9 7 
Z / 

^ fi1 
-j D X 

4 ^ 1 
ft j • j 

J J 


OQTOX0 

Q9tqx0 

cercopithec 

9 a 

Z o 


*± J ■ D 

J J £* 


Q9TV43 

Q9tv43 

cercopithec 

9 Q 

z y 

^ ^ R 

4 4 Q 

fx fx ■ Z7 

? s? 

J J 

fi 

D 

018770 

018770 

pan troglod 

J u 

^ ^ 7 
j j / 

4 4 R 
ft ft • o 

^ s? 

J J ^ 

fi 
D 

OQTV4 7 

Q9tv47 

cercopithec 

J X 

1 S7 
j j / 

4 4 ft 

35? 

J J £> 


09RGN5 

Q9bgn5 

cercopithec 

^9 

J Z 

"3 S7 
j j / 

4 4 ft 

352 


Q9XT12 

Q9xtl2 

cercopithec 

J J 

*3 R7 

4 4 ft 

352 


Q9TSQ7 

Q9tsq7 

cercopithec 

"34 

J 4 

^ S 

4 4 7 

^ S? 

J J <!. 


09 5ND? 

Q95nd2 

mandril lus 

o c 
JJ 

"3 ^ £ 

JJU 

4 4 7 
ft ft . / 

^S? 

JJi 

fi 

D 

OQ SND1 

Q95ndl 

mandrillus 

"3fi 

J J J 

4 4 S 

352 


Q9XT76 

Q9xt76 

cercopithec 

^ 7 

j / 

^ s ^ 

JJJ 

4 4 ^ 
ft ft • j 

^S? 

J JZ. 


Q9TV44 

Q9tv44 

cercopithec 

^ ft 
j 0 

"3 s s 

JJJ 

4 4 S 

352 

6 

09XS35 

Q9xs35 

macaca neme 

J -7 

1 s s 

JJJ 

4 4 S 
*± i • j 

352 


09MZA3 

Q9mza3 

hylobates a 


JJJ 

44 . 5 

352 

6 

Q9TV93 

Q9tv93 

macaca arct 

41 

355 

44.5 

352 

6 

Q95ND0 

Q95nd0 

erythrocebu 

42 

346 

43.4 

352 

6 

Q9BGN6 

Q9bgn6 

cercopithec 

43 

345 

43.3 

339 

4 

Q9UN25 

Q9un25 

homo sapien 

44 

344 

43.2 

339 

4 

Q9UN23 

Q9un23 

homo sapien 

45 

344 

43.2 

339 

4 

Q9UN27 

Q9un27 

homo sapien 


ALIGNMENTS 


RESULT 1 
018772 

ID 018772 PRELIMINARY; PRT; 352 AA. 

AC 018772; 

DT 01-JAN-1998 (TrEMBLrel. 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 


DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Chemokine receptor CCR5 . 

GN CCR5 . 

OS Cercocebus torquatus torquatus, and 

OS Cercocebus torquatus atys (Red-crowned mangabey) (Sooty mangabey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae ; Cercocebus. 

OX NCBI_TaxID=81944, 9531; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC SPECIES=C. torquatus torquatus; STRAIN=1049, and 997; 

RA Chen Z., Kwon D., Jin Z., Monard S., Telfer P., Jones M. , Lu C, 

RA Aguilar R. , Ho D.D., Marx P. A.; 

RT "Natural infection of a homozygous delta 24 CCR5 red-capped mangabey 

RT with a R2b-tropic simian immunodeficiency virus."; 

RL J. Exp. Med. 0:0-0(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=C. torquatus atys; 

RA Palacios E. , Digilio L., McClure H.M., Chen Z., Marx P. A., 

RA Goldsmith M.A., Grant R.M. ; 

RT "Parallel evolution of CCR5-null phenotypes in humans and in a natural 

RT host of simian immunodeficiency virus."; 

RL Curr. Biol. 0:0-0(1998). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC SPECIES=C. torquatus torquatus; STRAIN=RCM411 ; 

RA Beer B.E., Kuiken C.L., Tooze Z., Foley B.T., Goeken R.M., Brown C.R., 

RA St Claire M. , Hirsch V.M. ; 

RT "Characterization of novel simian immunodeficiency viruses from 

RT redcapped mangabeys from Nigeria ( SIVrcmNgM411 and NgD409)."; 

RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF094753; AAC62474.1; -. 

DR EMBL; AF079473; AAC31194.1; -. 

DR EMBL; AF084003; AAC62471.1; -. 

DR EMBL; AF349683; AAK69685.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 344 AA; 39592 MW; E15F5F601191A4D1 CRC64; 


Query Match 46.4%; Score 370; DB 6; Length 344; 

Best Local Similarity 34.9%; Pred. No. 4e-33; 

Matches 96; Conservative 9; Mismatches 28; Indels 142; Gaps 7; 


Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 3 0 

M I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 


31 


61 


65 


119 


87 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG- — 

I I I I I I I I I I I I I I I : I : : I 

LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 


64 


118 


86 


LCTRSQKEGLHYTCSSHFPYSQ 

: I I I I : II I I I I I I I I I I I I I 
IILLTIDRYLAIVHAVFALKARTVTFGWAVFASLPGIIFTRSQREGLHYTCSSHFPYSQ 17 8 


-HQRVHGG- 


YQFWKNFQTLKI 

I M I I I I I I I I I : s I 

179 YQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTIMIVYFL 

106 -GGS YKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I II I I I I I I I I I : I 

239 FWAP YNIVLLLNT FQEFFGLNNCS SSNRLDQAMQV 273 


105 


238 


RESULT 
077833 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RA 


RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
SQ 


Created) 

Last sequence update) 
Last annotation update) 


077833 PRELIMINARY; PRT; 344 AA. 

077833; 

01-NOV-1998 (TrEMBLrel. 08, 
01-NOV-1998 (TrEMBLrel. 08, 
01-JUN-2003 (TrEMBLrel. 24, 
Chemokine receptor CCR5 . 
CCR5. 

Cercocebus torquatus torquatus . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 
Cercopithecinae; Cercocebus . 
NCBI_TaxID=81944; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=1208, and 009; 

Chen Z., Kwon D., Jin Z., Monard S., Telfer P., Jones M. , Lu C, 
Aguilar R. , Ho D.D., Marx P. A.; 
"Natural infection of a homozygous delta 24 CCR5 red-capped mangabey 
with a R2b-tropic simian immunodeficiency virus."; 
J. Exp. Med. 0:0-0(1998). 
EMBL; AF094752; AAC62473.1; -. 

AAC62470.1; -. 

C: integral to membrane; IEA. 
F: rhodopsin-like receptor activity; IEA. 

P:G-protein coupled receptor protein signalin. . .; IEA. 
InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 
PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 
SEQUENCE 344 AA; 39578 MW; 0CEC05E47C2F6DE6 CRC64; 


EMBL; AF084002; 
GO; GO: 0016021; 
GO; GO: 0001584; 
GO; GO: 0007186; 


Query Match 46.4%; 
Best Local Similarity 34.9%; 
Matches 96; Conservative 


Score 370; DB 6; 
Pred. No. 4e~33; 
9; Mismatches 28; 


Length 344; 
Indels 142; 


Gaps 


7; 


Qy 


1 MD YQVS S P I YD I N Y YT SEP CQ KI NVKQ I AA- 


30 


M I I I I I I I II : I I I I I I I I I I I I I I I 


Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

87 

Db 

179 

Qy 

106 

Db 

239 


1 1 1 1 1 1 1 1 1 1 1 1 I 


-LCTRSQKEGLHYTCSSHFPYSQ 8 6 
: I I I I : I I I I I I I I I I I I I I I 


YQFWKNFQTLKI HQRVHGG 105 

I I II I I I I I I 1 I : : I 


-GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

I I 1 I I I I I I I I I I I I I 11 : I 

FWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 273 


RESULT 4 
Q95NC5 

ID Q95NC5 PRELIMINARY; PRT; 352 AA. 

AC Q95NC5; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Hylobates syndactylus (Siamang) ( Symphalangus syndactylus) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates. 

OX NCBI_TaxID=9590; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A. , Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny . " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF177884; AAK43367.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40508 MW; F4F64B3AD5AF658A CRC64; 

Query Match 46.3%; Score 369; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 5.3e-33; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 


Qy 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 
I I I I I II I I I I : I I I I I I I I I I I I I I I I I 


30 


Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I I I II I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I DRYLAI VHAVFALKART VT FGWT S VI TWWAVFAS L P G 1 1 FT RS Q KE GLH YT C 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I ! : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I II I I I i I I I ! I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 5 
Q9TV50 

ID Q9TV50 PRELIMINARY; PRT; 352 AA. 

AC Q9TV50; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5 . 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

OX NCBI_TaxID=9598 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=1410; 

RX MEDLINE=99335215; PubMed-104 087 30 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop O., Rigoulet J., Barre-Sinoussi F., Fomsgaard A. ; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=1410; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J., Diop O., 

RA Georges-Courbot M.-C, Barre-Sinoussi F. , Fomsgaard A. ; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF035214; AAD44007.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO:0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G PR0TEIN_RECEP_F1_2 ; 1. 


KW Receptor. 

SQ SEQUENCE 352 AA; 


40481 MW; 2578A0F2C07A4A65 CRC64; 


Query Match 46.3%; Score 369; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 5.3e-33; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I II I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGEVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I | I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I 1 I I I II 

Db 119 IILLSIDRYLAI VTiAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I II I - I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I II I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 
Q95NC3 

ID Q95NC3 PRELIMINARY; PRT; 352 AA. 

AC Q95NC3; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Miopithecus talapoin (Talapoin) ( Cercopithecus talapoin) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Miopithecus. 

OX NCBI_TaxID=36231; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A. , Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny . " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF177886; AAK43369.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS 00237; G PROTEIN RECEP Fl_l; 1- 


DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
KW Receptor.. 

SQ SEQUENCE 352 AA; 40546 MW; 6464 152F3E566AE5 CRC64; 

Query Match 46.0%; Score 367; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 8.9e-33; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

MDYQVSSPTYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I : : I : : I 

LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC— RLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 7 8 

: I I I I : II I I I I I 

IILLTIDRYIAIVliAVFALKARTVTFGWrSVITWWAVFASLPGIIFTRSQREGLHYTC 17 8 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I I 
SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 


Qy 

i 

Db 

l 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


RESULT 7 
018771 

ID 018771 PRELIMINARY; PRT; 352 AA. 

AC 018771; 

DT 01-JAN-1998 (TrEMBLrel . 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CCR5 receptor (Fragment) . 

GN CCR5 . 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Pan. 

OX NCBI__TaxID=9598 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ChCCR5-141a; 

RA Zhang L . , Carruthers CD., He T., Huang Y., Cao Y. , Wang G., Hahn B., 

RA Ho D.D. ; 

RT "HIV-1 subtypes, co-receptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 0:0-0(1997). 

DR EMBL; AF011539; AAB65739.1; ~. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 


DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

FT NON_TER 352 352 

SQ SEQUENCE 352 AA; 4 04 66 MW; 3FFFAC7ABAE1D4FB CRC64; 

Query Match 46.0%; Score 367; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 8.9e-33; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 3 0 

I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISNLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: II I I I I I I I I I 

Db 119 IILLTIDRYLAIVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKGGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 8 
Q9TSK1 

ID Q9TSK1 PRELIMINARY; PRT; 352 AA. 

AC Q9TSK1; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE G-protein coupled chemokine receptor. 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercopithecus. 

OX NCBI_TaxID=9534 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Holtkamp N . , Baier M. , Werner A. ; 

RL Submitted (AUG-1997) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AF019379; AAD01639.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR__Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G PROTEIN RECEP_F1_1; 1. 


DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
KW Receptor. 

SQ SEQUENCE 352 AA; 40588 MW; 0F1869D9A6668DBB CRC64; 

Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 3 0 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I II I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I II I I I 

Db 119 1 1 LLT I DRYLAI VHAVFALKARTVT FGWT S VI TWWAVFAS LP RI I FT RSQREGLH YTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I II I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I II II I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 9 
Q9TV4 9 

ID Q9TV49 PRELIMINARY; PRT; 352 AA. 

AC Q9TV4 9; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5 (C-C chemokine receptor 5) . 

GN CCR5 . 

OS Cercocebus galeritus (Agile mangabey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=9532 ; 

RN fl] 

RP SEQUENCE FROM N.A. 

RC STRAIN=4; 

RX MEDLINE=99335215; PubMed=l 04 08730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J . , Georges-Courbot M.C., 

RA Diop 0., Rigoulet J., Barre-Sinoussi F., Fomsgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=4; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J., Diop 0., 

RA Georges-Courbot M.-C, Barre-Sinoussi F. , Fomsgaard A.; 


RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Zhang Y. , Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny . "; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF035215; AAD44008.1; -. 

DR EMBL; AF177898; AAK43381.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40479 MW; 5E15 04A9B8727 8B2 CRC64; 

Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

LKSMTDI YLLNLAI S DLLFLLTVP FWAH YAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

LCTRSQKEGLHYTC 78 

: I I I I : I I I I I I I 

IILLTIDRYIA.IVmVFALKARTVTFGWTSVITWWAVFASLPGIIFTRSQREGLHYTC 178 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I II I I I I I I I : : I 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I II II I I I I II I I I I I I : I 


RESULT 10 
Q9XT13 

ID Q9XT13 PRELIMINARY; PRT; 352 AA. 

AC Q9XT13; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor 5. 

GN CCR5 . 

OS Papio anubis (Olive baboon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 


OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Papio. 

OX NCBI_TaxID=9555; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Spearman P.W., Mburu D.N., Graham B.S.; 

RT "Differential Utilization of CCR5 Molecules from Three East African 

RT Simian Species by the HIV-1 Envelope Glycoprotein."; 

RL Submitted (APR-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF141640; AAD32685.1; -. 

DR GO; GO: 0016021; Crintegral to membrane; IEA. 

DR GO; GO:0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_JL; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40489 MW; 0B4 7E337C11E2E1E CRC64; 

Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 15 0; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I II I I I : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I II I : I : : I 

Db 61 LKSMTDNYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: 1111:1111111 

Db 119 IILLTIDRYLAIVI1AVFALKARTVTFGWTSVITWWAVFASLPGIIFTRSQREGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QE FFGLNNC S S SNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 11 
Q9TV45 

ID Q9TV45 PRELIMINARY; PRT; 352 AA. 

AC Q9TV45; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5 . 

OS Cercopithecus nictitans (white-nosed guenon) . 


OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=3622 8; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=7 ; 

RX MEDLINE=99335215; PubMed=10408730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop O., Rigoulet J., Barre-Sinoussi F. , Fomsgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=7 ; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J., Diop 0., 

RA Georges-Courbot M.-C, Barre-Sinoussi F. , Fomsgaard A.; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF035219; AAD44012.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40658 MW; BC665891ED5661F8 CRC64; 

Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I II I I I : I I I I I I I I I II I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPLLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: 1111:1111111 

Db 119 IILLTIDRYIAIVmVFALKARTV^FGVA^SVITWVVAVFASLPRIIFTRSQREGLHYTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I II II I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVMLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I II I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 12 
Q95NE8 


ID Q95NE8 PRELIMINARY; PRT; 352 AA. 

AC Q95NE8; 

DT Ol-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5 . 

OS Cercopithecus tantalus . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=60712; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99335215; PubMed=104 08730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop O., Rigoulet J., Barre-Sinoussi F., Fomsgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates. "; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

DR EMBL; AF081577; AAD45495.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

SQ SEQUENCE 352 AA; 40576 MW; 0F18 69D9B7 968DBB CRC64; 


Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I II I : I I I I I I I I II I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I I I I 

Db 119 1 1 LLT I DRYLAI VHAVFALKART VT FGWT S VI TWWAVFAS L P RI I FT RS QREGLH YTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I II I I I I I I I I I :: I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I II I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 13 
Q9XT14 

ID Q9XT14 PRELIMINARY; PRT; 352 AA. 

AC Q9XT14; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor 5. 

GN CCR5 . 

OS Colobus guereza (Black-and-white colobus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Colobus. 

OX NCBI_TaxID=33548; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Spearman P.W., Mburu D.N., Graham B.S.; 

RT "Differential Utilization of CCR5 Molecules from Three East African 

RT Simian Species by the HIV-1 Envelope Glycoprotein."; 

RL Submitted (APR-1999) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AF141639; AAD32684.1; -. 

DR GO; GO: 0016021; Crintegral to membrane; IEA. 

DR GO; GO:0004872; Frreceptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40550 MW; 9B07 8EF04D34DB3 6 CRC64; 

Query Match 45.8%; Score 3 65; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.5e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MD YQVS S P I YD IN Y YT SEP CQKI NVKQ I AA 30 

I I I I I I I I I I I : I I I I I III I : I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I : I I I I I I I 

Db 119 IILLTIDRYI^IVliAVFALKARTATFGVVrSVITWWAVFASLPGIIFTRSQREGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI : HQRVHGG 105 

I II I I I I II I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLWIVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I II I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 14 
Q95NC6 

ID Q95NC6 PRELIMINARY; PRT; 352 AA. 

AC Q95NC6; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Trachypithecus johnii (hooded leaf monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Trachypithecus . 

OX NCBI_TaxID=66063; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A., Zhang Y.; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny. "; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF177883; AAK43366.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP__F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40462 MW; 52 824E0322559F7F CRC64; 

Query Match 45.8%; Score 365; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.5e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I 1 I I I I I I I : I I II I I I I I : I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWXILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I I I I 

Db 119 1 1 LLT I DRYLAI VI1AVFALKART W FGVW SVI TWWAVFAS LPGI I FT RSQREGLH YTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVI LGLVLPLLVMVI CYSGILKTLLRCRSEKKRHRAVRLI F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I II I I I I I I II I I I : I 


Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 15 
Q95NC8 

ID Q95NC8 PRELIMINARY; PRT; 352 AA. 

AC Q95NC8; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Colobus polykomos . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Colobus . 

OX NCBI_TaxID=9572 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y. , Ryder O.A. , Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny. " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF177881; AAK43364.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40578 MW; 4366F149C3B4938F CRC64; 

Query Match 45.8%; Score 365; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.5e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I : I I I I I I I 


Qy 

l 

Db 

l 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 


LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I I I I 

1 1 LLT I DRYLAI VHAVFALKARTVTFGWTS VITWWAVFAS LPGI I FTRSQREGLH YTC 17 8 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I II I I I I I I I I I I I : : I 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 


Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 
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ALIGNMENTS 


RESULT 1 
CKR5_HUMAN 

ID CKR5_HUMAN STANDARD; PRT; 352 AA. 

AC P51681; 014692; 014693; 014695; 014696; 014697; 014698; 014699; 

AC 014700; 014701; 014702; 014703; 014704; 014705; 014706; 014707; 

AC 014708; 015538; Q9UPA4 ; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) 

DE (HIV-1 fusion co-receptor) (CHEMR13) (CD195 antigen) . 

GN CCR5 OR CMKBR5. 

OS Homo sapiens (Human) . 

0C Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

0C Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96241590; PubMed=8639485 ; 

RA Samson M. , Labbe 0., Mollereau C, Vassart G. , Parmentier M. ; 

RT "Molecular cloning and functional expression of a new human 

RT CC-chemokine receptor gene."; 

RL Biochemistry 35:3362-3367(1996). 

RN [2] 


RP SEQUENCE FROM N.A. 

RX MEDLINE=96291862; PubMed=8 663314 ; 

RA Raport C.J., Gosling J., Schweichart V.L., Gray P.W., Charo I.F.; 

RT "Molecular cloning and functional characterization of a novel human 

RT CC chemokine receptor (CCR5) for RANTES, MlP-lbeta, and MlP-lalpha. "; 

RL J. Biol. Chem. 271:17161-17166(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96295970; PubMed=8699119; 

RA Combadiere C, Ahuja S.K., Tiffany H.L., Murphy P.M.; 

RT "Cloning and functional expression of CC CKR5, a human monocyte CC 

RT chemokine receptor selective for MIP-1 (alpha) , MlP-l(beta), and 

RT RANTES . " ; 

RL J. Leukoc. Biol. 60:147-152(1996). 

RN [4] 

RP SEQUENCE FROM N.A. 

RA McCombie W.R., Wilson R. , Chen E., Gibbs R., Zuo L. , Johnson D . , 

RA Nhan M., Parnell L., Dedhia N., Ansari A., Mardis E., Schutz K. , 

RA Gnoj L., la Bastide M. , Kaplan N., Greco T., Touchman J. , 

RA Muzny D., Chen C.N., Evans C, Fitzgerald M., See L.H., Tang M. , 

RA Porcel B.M., Dragan Y. , Giacalone J., Pae A., Powell E . , 

RA Solinsky K. A. , Desilva U., Diaz-Perez S., Zhou X., Yu Y., 

RA Watanabe M. , Doggett N., Garcia D., Sagripanti J.L.; 

RL Submitted (MAY-1997) to the EMBL/ GenBank/ DDB J databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDL1NE=98001387; PubMed=9343222 ; 

RA Kuhmann S.E., Piatt E.J., Kozak S.L., Kabat D.; 

RT "Polymorphisms in the CCR5 genes of African green monkeys and mice 

RT implicate specific amino acids in infections by simian and human 

RT immunodeficiency viruses."; 

RL J. Virol. 71:8642-8656(1997). 

RN [6] 

RP SEQUENCE FROM N.A. , AND POLYMORPHISMS. 

RX MEDLINE=98022612; PubMed=9359654 ; 

RA Zhang L., Carruthers CD., He T., Huang Y., Cao Y., Wang G. , Hahn B . , 

RA Ho D.D. ; 

RT "HIV type 1 subtypes, coreceptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 13:1357-1366(1997). 

RN [7] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98049523; PubMed=938 8201 ; 

RA Mummidi S., Ahuja S.S., McDaniel B.L., Ahuja S.K.; 

RT "The human CC chemokine receptor 5 (CCR5) gene. Multiple transcripts 

RT with 5 '-end heterogeneity, dual promoter usage, and evidence for 

RT polymorphisms within the regulatory regions and noncoding exons."; 

RL J. Biol. Chem. 272:30662-30671(1997). 

RN [8] 

RP SEQUENCE FROM N.A. , AND VARIANT ARG-178. 

RA Magierowska M. , Barre-Sinoussi F., Issafras H., Theodorou I., 

RA Debre P. ; 

RL Submitted (MAR-1998) to the EMBL/ GenBank/ DDB J databases. 

RN [9] 

RP SEQUENCE FROM N.A. 

RA Kopatz S.A. , Aronstam R.S., Sharma S.V.; 

RT "cDNA clones of human proteins involved in signal transduction 

RT sequenced by the Guthrie cDNA resource center (www.cdna.org)."; 


RL Submitted (JAN-2003) to the EMBL/ GenBank/DDBJ databases. 

RN [10] 

RP CHARACTERIZATION OF ITS HIV-1 RECEPTOR FUNCTION. 

RX MEDLINE=96260017; PubMed=864 9511 ; 

RA Deng H., Liu R. , Ellmeier W., Choe S., Unutmaz D., Burkhart M. , 

RA di Marzio P., Marmon S., Sutton R.E., Hill CM. , Davis C.B., 

RA Peiper S.C., Schall T.J., Littman D.R., Landau N.R.; 

RT "Identification of a major co-receptor for primary isolates of 

RT HIV-1."; 

RL Nature 381:661-666(1996). 

RN [11] 

RP CHARACTERIZATION OF ITS HIV-1 RECEPTOR FUNCTION . 

RX MEDLINE=96260018; PubMed=8 649512 ; 

RA Dragic T., Litwin V. , Allaway G.P., Martin S.R., Huang Y. , 

RA Nagashima K.A. , Cayanan C, Maddon P.J., Koup R.A. , Moore J. P., 

RA Paxton W.A. ; 

RT "HIV-1 entry into CD4 + cells is mediated by the chemokine receptor 

RT CC-CKR-5."; 

RL Nature 381:667-673 (1996) . 

RN [12] 

RP SULFATION. 

RX MEDLINE-99189752; PubMed=100898 82 ; 

RA Farzan M. , Mirzabekov T., Kolchinsky P., Wyatt R. , Cayabyab M. , 

RA Gerard N.P., Gerard C, Sodroski J., Choe H.; 

RT "Tyrosine sulfation of the amino terminus of CCR5 facilitates HIV-1 

RT entry. 11 ; 

RL Cell 96:667-676(1999). 

cc _i_ FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and rantes and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. Acts as co-receptor with CD4 for primary non- 

ce syncytium-inducing strains (NSI) (macrophage-tropic) of HIV-1 

CC virus. It promotes Env-mediated fusion of the virus. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Found in promyelocytic cells. 

CC -!- PTM: Sulfation contributes to the efficiency of HIV-1 entry. 

CC -!- PTM: Modified by O-linked glycosylation, but not by N-linked 
CC glycosylation. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 7"" 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 


CC 



DR 

EMBL; 

X91492; CAA62796.1; -. 

DR 

EMBL; 

U54994; AAC50598.1; -. 

DR 

EMBL; 

U57840; AAB17071.1; 

DR 

EMBL; 

U95626; AAB57793.1; -. 

DR 

EMBL; 

U83326; AAC51797.1; -. 

DR 

EMBL; 

AF011500; AAB65700.1; -. 

DR 

EMBL; 

AF011501; AAB65701.1; -. 

DR 

EMBL; 

AF011502; AAB65702.1; -. 


DR EMBL; AF011503; AAB65703.1; 

DR EMBL; AF011505; AAB65705.1; -. 

DR EMBL; AF011506; AAB65706.1; -. 

DR EMBL; AF011507; AAB65707.1; -. 

DR EMBL; AF011508; AAB65708.1; -. 

DR EMBL; AF011509; AAB65709.1; 

DR EMBL; AF011510; AAB65710.1; -. 

DR EMBL; AF011511; AAB65711.1; -. 

DR EMBL; AF011512; AAB65712.1; -. 

DR EMBL; AF011513; AAB65713.1; 

DR EMBL; AF011514; AAB65714.1; 

DR EMBL; AF011515; AAB65715.1; -. 

DR EMBL; AF011516; AAB65716.1; -. 

DR EMBL; AF011517; AAB65717.1; -. 

DR EMBL; AF011518; AAB65718.1; 

DR EMBL; AF011519; AAB65719.1; -. 

DR EMBL; AF011520; AAB65720.1; -. 

DR EMBL; AF011521; AAB65721.1; -. 

DR EMBL; AF011522; AAB65722.1; 

DR EMBL; AF011523; AAB65723.1; 

DR EMBL; AF011524; AAB65724.1; -. 

DR EMBL; AF011525; AAB65725.1; -. 

DR EMBL; AF011526; AAB65726.1; 

DR EMBL; AF011527; AAB65727.1; -. 

DR EMBL; AF011528; AAB65728.1; 

DR EMBL; AF011529; AAB65729.1; -. 

DR EMBL; AF011530; AAB65730.1; 

DR EMBL; AF011531; AAB65731.1; 

DR EMBL; AF011532; AAB65732.1; 

DR EMBL; AF011533; AAB65733.1; 

DR EMBL; AF011534; AAB65734.1; 

DR EMBL; AF011535; AAB65735.1; -. 

DR EMBL; AF011536; AAB65736.1; -. 

DR EMBL; AF011537; AAB65737.1; 

DR EMBL; AF031237; AAB94735.1; -. 

DR EMBL; AF052539; AAD18131.1; 

DR EMBL; AY221093; AA065971.1; -. 

DR Genew; HGNC:1606; CCR5 . 

DR MIM; 601373; -. 

DR GO; GO: 0005768; C:endosome; TAS . 

DR GO; GO: 00058 87; C: integral to plasma membrane; TAS. 

DR GO; GO: 0016493; F:C-C chemokine receptor activity; NAS . 

DR GO; GO: 0015026; F:coreceptor activity; TAS. 

DR GO; GO: 0007267; P: cell-cell signaling; TAS. 

DR GO; GO: 0006968; P: cellular defense response; TAS. 

DR GO; GO: 0006935; P : chemotaxis ; TAS. 

DR GO; GO: 0007204; P:cytosolic calcium ion concentration elevation; TAS. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; TAS. 

DR GO; GO: 0006954; P : inflammatory response; TAS. 

DR GO; GO: 0007125; P: invasive growth; TAS. 

DR GO; GO:0007203; P : phosphatidylinositol-4 , 5-bisphosphate hydro. . .; TAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation; 


KW 

Polymorphism. 



FT 
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FT 
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FT 
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CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 
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FT 
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9.0 

102 

EXTRACELLULAR (POTENTIAL) 

FT 
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3 (POTENTIAL) . 


Query Match 47.6%; Score 379; DB 1; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.8e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 
Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | 1111111111111:1: : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

D b 119 IILLTIDRYLAWHAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI ■ HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I ! I I I I I II I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 2 
CKR5_PANTR 

ID CKR5_PANTR STANDARD; PRT; 352 AA. 

AC P56440; 002778; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

OX NCBI_TaxID=9598; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97268687; PubMed=9108095 ; 

RA Edinger A.L., Amedee A., Miller K. , Doranz B.J., Endres M. , 

RA Sharron M. , Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M., 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains.* 1 ; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4010(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 


RA Zimmerman P. A., Buckler-White A., Alkhatib G. ; 

RL Submitted (MAR-1997) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98022612; PubMed=9359654 ; 

RA Zhang L . , Carruthers CD., He T . , Huang Y., Cao Y. , Wang G., Hahn B., 

RA Ho D.D.; 

RT "HIV type 1 subtypes, coreceptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 13:1357-1366(1997). 

RN [4] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97426118; PubMed=9282822 ; 

RA Zacharova V., Zachar V., Goustin A.S.; 

RT "Sequence of chemokine receptor gene CCR5 in chimpanzees, a natural 

RT HIV type 1 host."; 

RL AIDS Res. Hum. Retroviruses 13:1159-1161(1997). 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98090115; PubMed=9430250 ; 

RA Pretet J.-L., Zerbib A., Girard M. , Guillet J.-G., Butor C. ; 

RT "Chimpanzee CXCR4 and CCR5 act as coreceptors for HIV type 1."; 

RL AIDS Res. Hum. Retroviruses 13:1583-1587(1997). 

RN [6] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A., Zhang Y.; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AF005663; AAB62557.1; -. 

DR EMBL; U94329; AAB58446.1; 

DR EMBL; AF011542; AAB65742.1; -. 

DR EMBL; U97666; AAC51670.1; -. 

DR EMBL; AF011540; AAB65740.1; -. 

DR EMBL; U89797; AAC03717.1; -. 

DR EMBL; AF177894; AAK43377.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 
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FT 

DOMAIN 
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FT 
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FT 
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N-LINKED (GLCNAC. . . ) ( PO 

FT 

CONFLICT 

123 
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T -> S (IN REF. 1) . 

SQ 

SEQUENCE 

352 AA; 

40539 

MW; 4A33E698B80FE34C CRC64; 


Query Match 46.9%; Score 374; DB 1; Length 352; 

Best Local Similarity 34.6%; Pred. No. 5.6e-29; 

Matches 98; Conservative 8; Mismatches 27; Indels 150; 


Gaps 


7; 


Qy 

i 

Db 

l 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I II I I I I I I I I : I I I I I I I I I II I 11 I I I 

MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


-YKCGLC- 
I I 


-AAAQWDFGNTMCQHQRVHGHHHHSYKCG 

I I I I I I I II I II I : I : : I 


64 


-LCTRSQKEGLHYTC 7 8 
: I I I I I I II I I I I 


-HQRVHGG- 


105 


I I I I I I I ! I I I ! I I I t 1 I I I 


-GGSYKCGLC- 
I I 


-QEFFGLNNCSSSNRLDGHQRV 135 
I I I I I I II I I I I I I I I : I 


RESULT 3 
CKR5_HYLSY 

ID CKR5_HYLSY STANDARD; PRT; 352 AA. 

AC Q95NC5; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5). 


GN CCR5 OR CMKBR5 . 

OS Hylobates syndactylies (Siamang) •( Symphalangus syndactylus) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates . 

OX NCBI_TaxID=9590; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=9 9416438 ; PubMed=1048 697 0 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 


CC 





DR 
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FT 
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SQ 

SEQUENCE 

352 AA; 

40508 MW; F4F64B3AD5AF658A CRC64; 


Sulfation. 


Query Match 46.3%; Score 369; DB 1; Length 352; 

Best Local Similarity 34.3%; Pred. No. 1.7e-28; 


Matches 97; Conservative 


8; Mismatches 28; Indels 150; Gaps 


7; 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy ■ 

79 

Db 

179 

Qy 

106 

Db 

239 


MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

II I II I I I II I : I I I I I I I I M I I I I I I I 

MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


I I 1111111111111:1: : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 78 

: I I I I II I I I I I I 

IILLTIDRYLAIvTIAVFALKARTWFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I II I I I I I I I I I I I I I I I I : : I 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 2 38 

GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I II I I I I I I I I I I I I I I : I 

TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 4 
CKR5_PONPY 

ID CKR5_PONPY STANDARD; PRT; 352 AA. 

AC 097881; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5 . 

OS Pongo pygmaeus (Orangutan) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pongo. 

OX NCBI_TaxID=9600; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=994 16438; PubMed=l 04 8 697 0 ; 

RA Zhang Y.-W., Ryder O.A. , Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 
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69 

89 

Z (POTENTIAL) . 

FT 

DOMAIN 

90 

1 Uz 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 ( POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

z 19 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40527 MW; F4E2F47135AF658A CRC64; 


Query Match 46.3%; Score 369; DB 1; Length 352; 

Best Local Similarity 34.3%; Pred. No. 1.7e-28; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I II I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAIVmVFALI^RTWFGWTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I II I II I I I I II I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 5 
CKR5 PAP HA 


ID CKR5_PAPHA STANDARD; PRT; 352 AA. 

AC P56441; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) ( CCR5 ) . 

GN CCR5 OR CMKBR5. 

OS Papio hamadryas (Hamadryas baboon), and 

OS Papio anubis (Olive baboon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Papio. 

OX NCBI_TaxID=9557, 9555; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . hamadryas ; 

RX MEDLINE=97268 687; PubMed=9108095 ; 

RA Edinger A.L., Amedee A., Miller K., Doranz B.J., Endres M. , 

RA Sharron M. , Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder CC, Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4 010(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . hamadryas ; 

RX MEDLINE=99210133; PubMed=10 195758 ; 

RA Saksena N.K., Wang B. , Novembre F.J., Bolton W., Smit T.K., Lai R.B.; 

RT "Species-specific changes in the CCR5 gene from African and Asian 

RT nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:47 9-4 83(1999). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . anubis ; 

RA Benton P. A., Timanus D.K., Shearer M.H., Lee D.R., Kennedy R.C; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005658; AAB62552.1; -. 

DR EMBL; AF105287; AAD20556.1; -. 

DR EMBL; AF105288; AAD20557.1; 

DR EMBL; AF105289; AAD20558.1; -. 

DR EMBL; AF105290; AAD20559.1; 


DR 

EMBL; AF023452; AAC63830.1; 


DR 

InterPro; 

IPR000276; GPCR_Rhodpsn . 

DR 

Pfam; PF00001; 7tm_l; 1. 


DR 

PRINTS; PR00237; GPCRRHODOPSN . 

DR 

PRO SITE; 

PS 00237 ; 

G PROTEIN 

RECEP Fl 1; 1. 

DR 

i~\ ~r m tti _ 

PROSITE; 

PS50262 ; 

G PROTEIN 

"nTP/^TPT) TP "1 O . "1 

RECEP Fl 2; 1 . 

KW 

G-protein 

coupled 

receptor ; 

Transmembrane; Glycoprotein; Sulfati> 

FT 

DOMAIN 

1 

30 

EXTRACELLUIiAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

8 9 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) - 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

2 61 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD__RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

15 

15 

SULFATION (BY SIMILARITY) . 

FT 

CARBOHYD 

268 

268 

N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ 

SEQUENCE 

352 AA; 

40489 MW 

'; 5E1504A9BA1FE8B2 CRC64; 


Query Match 45.9%; Score 366; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 3.3e-28; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I II : I I I I I 1 I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 


Db 


119 IILLTIDRYLAIVIiAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQREGLHYTC 178 


QY 


Db 



106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 



Db 


239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 6 
CKR5 PYGBI 


ID CKR5_PYGBI STANDARD; PRT; 352 AA. 

AC 097880; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41., Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Pygathrix bieti (Black snub-nosed monkey) (Rhinopithecus bieti) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Pygathrix. 

OX NCBI_TaxID=61621; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 







DR 

EMBL; AF075445; AAD19857.1; 


DR 

InterPro; 

IPR000276; GPCR Rhodpsn. 

DR 

Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; PR00237; 1 

GPCRRHODOPSN. 

DR 

PROSITE; 

PS00237; 

G_PR0TEIN_ 

_RECEP_F1_1; 1. 

DR 

PROSITE; 

PS50262; 

g_protein~ 

_RECEP_F1_2; 1. 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotein 

FT 

DOMAIN 
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EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 
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58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 
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EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 


FT 

MOD_RES 

3 

3 

SULFATION 

(BY 

SIMILARITY) 

FT 

MOD_RES 

10 

10 

SULFATION 

(BY 

SIMILARITY) 

FT 

MOD_RES 

14 

14 

SULFATION 

(BY 

SIMILARITY) 

FT 

MOD_RES 

15 

15 

SULFATION 

(BY 

SIMILARITY) 

SQ 

SEQUENCE 

352 AA; 

40585 

MW; 4366F142730F938F CRC64; 


Query Match 45.8%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.1e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I II I I I I I I : I I I I I I I I I : I I I I I I I 
Db 1 MDYQVS SPTYDI DYYTSEPCQKVNVKQIAARLLPPLYSLVFI FGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 64 

II I I I I I I M I I I I I : I I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I I I I 

Db 119 1 1 LLT I DRYLAI VHAVFALKAOT VT FGWT S VI TWWAVFAS L PG 1 1 FT RS QREGLH YT C 17 8 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I II I I I I I I I I I I I I I I I I I 
Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I II I I I I I I I I I : I 

Db 2 39 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 7 
CKR5_PYGNE 

ID CKR5_PYGNE STANDARD; PRT; 352 AA. 

AC 097882; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5 . 

OS Pygathrix nemaeus (Dove langur) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Pygathrix. 

OX NCBI_TaxID=54133; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=1048 6970 ; 

RA Zhang Y.-W., Ryder O.A. , Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 


cc 
cc 
cc 

-!- SIMILARITY: Belongs to 

family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 

cc 

between 

the Swiss 

Institute of Bioinf ormatics and the EMBL outstation - 

cc 

the European Bioinf ormatics 

Institute. There are no restrictions on its 

cc 

use by 

non-profit institutions as long as its content is in no way 

cc 

modified 

and this 

statement 

is not removed. Usage by and for commercial 

cc 

entities 

requires 

a license 

agreement (See http://www.isb-sib.ch/announce/ 

cc 
cc 

DR 

or send an email to license@isb-sib . ch) . 

EMBL; AF075448; AAD19860.1; 


DR 

InterPro; 

IPR000276; GPCR_Rhodpsn . 

DR 

Pfam; PF00001; 7tir 

l_JL; 1. 


DR 

PRINTS; PR00237; GPCRRHODOPSN . 

DR 

PROSITE; 

PS00237; 

G_PROTEIN 

_RECEP_F1_1; 1. 

DR 

PROSITE; 

PS50262; 

G_PROTEIN 

_RECEP_F1_2 ; 1 . 

KW 

G-protein coupled 

receptor; 

Transmembrane; Glycoprotein; Sulfation. 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL). 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL). 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40532 MW; FE4F9D98D3B3E861 CRC64; 


Query Match 45.8%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.1e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 


Qy 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 30 


l l ll ll l l I I I • I i i i i i i i i • i i i i i i i 
1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 



Db 


Qy 


Db 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I II I I I I I I I I : I : : I 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


QY 


Db 



Qy 


7 9 SSHFPYSQYQFWKNFQTLKI 
I I I I II I I I I I I I I I I I I I I 


HQRVHGG- 105 

:: I 


Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLIMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I II I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 

RESULT 8 
CKR5JTRAFR 

ID CKR5JTRAFR STANDARD; PRT; 352 AA. 

AC 097878; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Trachypithecus francoisi (Francois 1 langur) (Indochinese langur) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Marnmalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Trachypithecus. 

OX NCBI_TaxID=5 4 18 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=104 8 697 0 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF075442; AAD19854.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 


DR 

PROSITE; 

PS00237; 

G_PROTEIN_ 

_RECEP_F1_1; 1. 

DR 

PROSITE; 

PS50262; 

G_PROTEIN~ 

_RECEP_F1_2; 1. 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotei 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 
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EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

103 
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3 (POTENTIAL). 

FT 

DOMAIN 
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141 

CYTOPLASMIC (POTENTIAL) . 


FT 

TRANSMEM 

142 

166 

4 ( POI EN 1 1AL ) . 

FT 

DOMAIN 

167 
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rVPnurTT T ITT 7\ Ti / Q ^ T 1 XT' > T rn t 71 t \ 

EXTRACELLULAR \ irUI EN 1 1AL J 

FT 
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Z lb 

D ( rOl EN 1 lAli ) . 

FT 

DOMAIN 
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FT 

TRANSMEM 
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6 ( POTENTIAL) . 

FT 

DOMAIN 
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EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

27 8 
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/ (POTENTIAL) . 

FT 

DOMAIN 
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t~* \7Tn /~\ Tt T 7V CKVTT / n/^TI T7i>.Tm T7\T \ 

CYTOPLASMIC (POTENI1AL) . 

FT 

T fTTT TP T FA 

DISULFID 

1U1 

T ""7 Q 

l/o 

-Q V O T TVjTT T 7\ "D T T 1 V 

Di blMlLAKlli. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) 

FT 

MOD RES 

10 

10 

SULFATION (BY SIMILARITY) 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) 

SQ 

SEQUENCE 

352 AA; 

40496 

MW; 4366F148C255938F CRC64; 


Query Match 45.8%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.1e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I : I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I : I I I I I I I 

Db 119 1 1 LLT I DRYLAI VHAVFALKART VT FGWT S VI TWWAVFAS LPGI I FT RS QREGLH YTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

M I I I I I I I I I I I I I I I I I I : : I 

Db 17 9 S SHFP YSQ YQFWKNFQTLKI VI LGLVLPLLVMVI CYSGI LKTLLRCRNEKKRHRAVRLI F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I II I I I I I I I I II II I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 9 
CKR5__TRAPH 

ID CKR5_TRAPH STANDARD; PRT; 352 AA. 

AC 097879; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 
GN CCR5 OR CMKBR5. 

OS Trachypithecus phayrei (Phayre's leaf monkey). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Trachypithecus. 

OX NCBI_TaxID=61618; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970 ; 


RA 

Zhang Y. - 

W., Ryder 

O.A. , Zhang Y.-P. ; 

RT 

"Sequence 

evolution 

of the CCR5 chemokine receptor gene in primates."; 

RL 

Mol . Biol 

. Evol. 16 

: 1145-1154 (1999) . 

CC 

-!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

cc 

MIP-1 

-beta and 

RANTES and subsequently transduces a signal by 

cc 

increasing the 

intracellular calcium ions level. May play a role 

CC 

in the control 

of granulocytic lineage proliferation or 

CC 

differentiation 


CC 

-!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC 
CC 
CC 

-!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC 

between 

the Swiss 

Institute of Bioinf ormatics and the EMBL outstation - 

CC 

the European Bioinf ormatics Institute. There are no restrictions on its 

CC 

use by 

non-profit 

institutions as long as its content is in no way 

CC 

modified 

and this statement is not removed. Usage by and for commercial 

CC 

entities 

requires a 

license agreement (See http://www.isb-sib.ch/announce/ 

cc 
cc 

DR 

or send an email to 

license@isb-sib . ch) . 

EMBL; AF075443; AAD19855.1; -. 

DR 

InterPro; 

IPR000276; GPCR_Rhodpsn . 

DR 

Pfam; PF00001; 7tm 

1; 1. 

DR 

PRINTS; PR00237; GPCRRHODOPSN . 

DR 

PROSITE; 

PS00237; G 

PROTEIN RECEP_F1_1; 1. 

DR 

PROSITE; 

PS50262; G 

PROTEIN RECEP Fl 2; 1. 

KW 

G-protein 

coupled receptor; Transmembrane; Glycoprotein; Sulfation. 

FT 

DOMAIN 

1 

30 EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 1 (POTENTIAL). 

FT 

DOMAIN 

59 

68 CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 EXTRACELLULAR (POTENTIAL). 

FT 

TRANSMEM 

278 

301 7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 BY SIMILARITY. 

FT 

MOD_RES 

3 

3 SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

10 

10 SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40509 MW; 4366F14 8D3A5938F CRC64 ; 


Query Match 45.8%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.1e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I 1 I I I I I I : I! I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


Qy 


31 


YKCGLC 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 


I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC-- QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I II I I I I 

Db 119 IILLTIDRYIAIVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQREGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I II I I I I I I I I II I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 
CKR5_GORGO 

ID CKR5_GORGO STANDARD; PRT; 352 AA. 

AC P56439; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5). 

GN CCR5 OR CMKBR5. 

OS Gorilla gorilla gorilla (Lowland gorilla) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Gorilla. 

OX NCBI_TaxID=95 95; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97268687; PubMed=9108 095 ; 

RA Edinger A.L., Amedee A., Miller K., Doranz B.J., Endres M. , 

RA Sharron M. , Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4010(1997). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005659; AAB62553.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; PR00237; GPCRRHODOPSN . 

DR 

FKUbl I E ; 

F lb (J U 2 .3 /; 

b rKOlEIN 


DR 

nnnC T T> T7 1 • 

rKUbl It; 


b FKU1E1N 


KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotein 

FT 

DOMAIN 

1 

30 

EX 1 RAOELLULAK { rUl HjJN 1 1AL ) . 

FT 

1 KAN bMrlM 

31 

bo 

1 / Dn r PC , MT 1 T AT \ 

I ( r U i EN 1 1 A1j ) . 

FT 

T\f~MUn\ TXT 

R Q 

o y 

DO 


FT 

1 KAN oMEM 

by 

o y 

O / D/^T TTTyTT 1 T A T \ 

Z \ r U i HjN 1 1 AJ-i ) * 

FT 

DOMAIN 


1 AO 

1U2 

TTVTD A r'PT T TTT A T5 /D/^T 1 TT'XTT 1 TAT \ 

EX 1 RACELLULAK ( rUl EN 1 1AL ) . 

FT 

TRANSMEM 

103 

12 4 

b { PO 1 EN 1 1AL ) . 

FT 

DOMAIN 

125 

141 

CY IOPLAdMIC (POIENIIAL) , 

FT 

m o 7\ xt ox/T'cxyr 

I RANbMEM 

1 A O 

142 

I bb 

4 \ r O I EN 1 1 All ) . 

FT 

T~\ /^iX K 7\ T XT 

DOMAIN 

lo / 

1 O Q 

i y o 

T?VTDAPT?T T TTT A T5 / DATT7MTT AT \ 

EX 1 RAOELLULAK [ FU1 EM 1 1AL ) . 

FT 

m T\ 7\ XT f^XJTTIX/T 

TRANSMEM 

199 

O 1 o 

2 lo 

C / T*i /^\rp T7*,T r P T A T \ 

b ( POl EN 1 1 AL J . 

FT 

DOMAIN 

219 

o o c 

23b 

CYTOPLASMIC (POTENTIAL). 

FT 

TRANSMEM 

236 

2 60 

/~ / n Arm?\TrnT t\ t \ 

b (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POIENIIAL) . 

FT 

TRANSMEM 

278 

301 

/ (POTENTIAL) . 

FT 

T"\/""\X/r 7\ TXT 

DOMAIN 

o no 


rvrnDT a cmt r* I daticmt 1 tat \ 

FT 

DISULFID 

101 

17 8 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40515 MW 

'; D0E6FCB9FE5EAC84 CRC64; 


Query Match 45.7%; Score 364; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 5.1e-28; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I 111:111111111 I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKTNVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 64 

II I I I I I I I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I II I I I I I I I I I 

Db 119 IILLTIDRYLAIVliAVFALKARTWFGvVrSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
CKR5_MACMU 

ID CKR5_MACMU STANDARD; PRT; 352 AA. 

AC P79436; 002746; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 


DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Macaca mulatta (Rhesus macaque) , 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey), and 

OS Macaca nemestrina (Pig-tailed macaque) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9544, 9541, 9545; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta; 

RX MEDLINE=97184592; PubMed=9032394 ; 

RA Marcon L., Choe H., Martin K.A. f Farzan M. , Ponath P.D., Wu L., 

RA Newman W., Gerard N., Gerard C, Sodroski J.; 

RT "Utilization of C-C chemokine receptor 5 by the envelope 

RT glycoproteins of a pathogenic simian immunodeficiency virus, 

RT SIVmac239."; 

RL J. Virol. 71:2522-2527(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta; STRAIN=Indian macaque; 

RX MEDLINE=97213934; PubMed=9060623 ; 

RA Chen Z., Zhou P., Ho D.D., Landau N.R., Marx P. A. ; 

RT "Genetically divergent strains of simian immunodeficiency virus use 

RT CCR5 as a coreceptor for entry."; 

RL J. Virol. 71:2705-2714(1997). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta; 

RX MEDLINE=21354176; PubMed=11461684 ; 

RA Margulies B.J., Hauer D.A., Clements J.E.; 

RT "Identification and comparison of eleven rhesus macaque chemokine 

RT receptors."; 

RL AIDS Res. Hum. Retroviruses 17:981-986(2 001). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta, M. fascicularis , and M. nemestrina ; 

RX MEDLINE=97268687; PubMed=9108095 ; 

RA Edinger A.L., Amedee A., Miller K. , Doranz B.J., Endres M. , 

RA Sharron M. , Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M., 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4 005-4010(1997). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 


use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U77672; AAC51109.1; -. 
EMBL; U73739; AAC51158.1; 
EMBL; U96762; AAC34132.1; -. 
EMBL; AF005660; AAB62554.1; 
EMBL; AF005661; AAB62555.1, 
EMBL; AF005662; AAB62556.1; 
InterPro; IPR00027 6; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G_PR0TEIN_RECEP__F1_1 ; 1. 
PROSITE; PS502 62; G PROTEIN RECEP_F1_2 ; 1. 


CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 


Best Local Similarity 33.6%; Pred. No. 6.4e-28; 
Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I II I : I II I I I I I I I I I I I I I I 
D b 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotein 

DOMAIN 

1 

30 

nvmriT\r<i?T TITT ■a t-\ / rt/~\rn T7>>,Trn TAT \ 

EXTRACELLULAR (POIhNllAL; . 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL). 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

DOMAIN 

q n 
y u 

102 

EXTRACELLULAR (POTENTIAL) . 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL). 

TRANSMEM 

142 

166 

4 ( POTENTIAL) . 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL). 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

DOMAIN 

261 • 

277 

EXTRACELLULAR (POTENTIAL) . 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

DISULFID 

101 

178 

BY SIMILARITY. 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

MOD RES 

10 

10 

SULFATION (BY SIMILARITY) . 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

MOD RES 

15 

15 

SULFATION (BY SIMILARITY) . 

CARBOHYD 

268 

268 

N-LINKED (GLCNAC. . .) (PO 

CONFLICT 

241 

241 

M -> I (IN REF. 3) . 

CONFLICT 

292 

292 

I -> M (IN REF. 3) . 

SEQUENCE 

352 AA 

; 40507 MW; 58B96C85909FACB2 CRC64; 

Query Match 


45. 5%; 

Score 363; DB 1; Length 


Qy 

Db 

Qy 

Db 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | i I I I I I I I I I II I : I = : I 

61 LKSMTDI YLLNLAI SDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFI GFFSGI FF 118 

65 LCTRSQKEGLHYTC 7 8 

: I I I I : I II I I I I 

119 1 1 LLT I DRYLAI VHAVFALKART VT FGWT S VI TWWAVFAS LPGI I FTRSQREGLH YT C 17 8 


Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I II II I I I I I I II I I I I : I 
Db 179 SSHFPYSQYQFWKNFQTLKMVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 12 
CKR5_HYLML 

ID CKR5_HYLML STANDARD; PRT; 352 AA. 

AC Q95NC0; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Hylobates moloch (Silvery gibbon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates. 

OX NCBI_TaxID=81572; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=104 86970 ; 

RA Zhang Y.-W., Ryder O.A. , Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 7 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF177899; AAK43382.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PRO 02 37; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 


FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) 


FT 

TRANSMEM 

103 

1 6 A 

1Z 4 

o \ rUl Hun 1 lAli) . 

FT 

DOMAIN 


1/11 

L-I lUriiAorllL \ rU 1 iiiIM 1 1/vU ) . 

FT 

TRANSMEM 

I/O 

14z 

i ^ ^ 

ft ^ rUl rjXM 1 lAli ) . 

FT 

T~\ M TV T" TAT 

DOMAIN 

16 / 

1 Q Q 

i y o 

TTYTDA PT?T T TTT TaD / DHT^WTT 7a T \ 

FT 

TRANSMEM 

199 

zlo 

D { rU 1 HjN 1 1 AL ) . 

FT 

DOMAIN 

zl9 

ZOO 

rvrPAD T A CMT C* I D C\ T 1 TT 1 \T T* T 7a T \ 

LllUFLAoMlL ( rUl HiJN 1 lAli ) . 

FT 

TRANSMEM 

236 

Z bU 

£T / DA^rMfP T 7\ T \ 
D [ CUi EjN 1 IAJj ) . 

FT 

DOMAIN 

Oct 
Z 61 

Oil 

Z / / 

TTVPOTV r'TTT T TTT 7A"D / DOT^MTT AT \ 

FT 

TRANSMEM 

278 

301 

/ ( FOl 1 1AL ) . 

r 1 

DOMAIN 

^ DO 

o D Z 


FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

14 

14 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40436 

MW; 9623CA98340CF274 CRC64; 


Query Match 45.4%; Score 362; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 8e-28; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I 

Db 1 MDYQVSSPTYDIDYYTSGPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 1 1 LLTI DRYLAIVHAVFALKARTVT FGVVTSVITWVVAVFASLPGI I FTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 : GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 13 
CKR5_CERPY 

ID CKR5_CERPY STANDARD; PRT; 352 AA. 

AC Q9TV42; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 
GN CCR5 OR CMKBR5. 

OS Cercopithecus pygerythrus (Vervet monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 
OC Cercopithecinae; Cercopithecus. 
OX NCBI_TaxID=60710; 
RN [1] 

RP SEQUENCE FROM N.A. 


RC STRAIN=100; 

RX MEDLINE=99335215; PubMed=10408730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.-C, 

RA Diop 0., Rigoulet J. , Barre-Sinoussi F., Forasgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC : 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF035222; AAD44015.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 


DR 

PROSITE; 

PS00237; 

G_PROTEIN_ 

_RECEP_F1_1; 1. 

DR 

PROSITE; 

PS50262; 

G_PR0TEIN_ 

~RECEP_F1_2; 1. 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotein 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR ( POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40558 MW; EF17D67CBCCC3DB0 CRC64; 


Query Match 45.3%; Score 361; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 9.9e-28; 

Matches 95; Conservative 9; Mismatches 29; Indels 150; Gaps 7; 


QY 


1 MD YQVS SPIYDINYYTS E PCQKINVKQ I AA 
I I I I I I I I II I : I I I I II I I I I I I I I I I I 


30 


Db 


1 MDYQVS S PT YDI DYYT S EPCQKINVKQIAARLLPPLYS LVFI FGFVGNI LWLI LINCKR 60 


Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I t I II I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNIAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I II I 

Db 119 IILLTIDRYIAIVHAVFALKARTWFGVVTSVITWWAVFASLPRIIFTRSQREGLHYAC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I II I I I II I I I I I I I I I I : : I 

Db 179 S SHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVI CYSGI LKTLLRCRNEKKRHRAVRLI F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 14 
CKR5_CERTO 

ID CKR5_CERTO STANDARD; PRT; 352 AA. 

AC 062743; 062744; 062745; 062746; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Cercocebus torquatus atys (Red-crowned mangabey) (Sooty mangabey) . 

0C Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=9531; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Isolate 079, 085, 087, and 089; 

RX MEDLINE=98321155; PubMed=965699 9 ; 

RA Chen Z., Gettie A., Ho D.D., Marx P. A.; 

RT "Primary SIVsm isolates use the CCR5 coreceptor from sooty mangabeys 

RT naturally infected in west Africa: a comparison of coreceptor usage 

RT of primary SIVsm, HIV-2, and SlVmac"; 

RL Virology 24 6:113-124(1998). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 


cc 







DR 

EMBL; AF051902; 

AAC39830. 

1; 


DR 

EMBL; AF051903; 

AAC39831. 

1; 


DR 

EMBL; AF051904; 

AAC39832. 

1; 


DR 

EMBL; AF051905; 

AAC39833. 

1; 


DR 

InterPro; 

IPR000276; GPCR Rhodpsn. 

DR 

Pfam; PF00001; 7tm 1; 1. 



DR 

PRINTS; PR00237 

; GPCRRHODOPSN. 

DR 

PROSITE; 

PS00237; 

G_PROTEIN_ 

RECEP Fl_l; 1. 

DR 

PROSITE; 

PS50262; 

G_PROTEIN~ 

RECEP Fl_2 ; 1 . 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotein; Sulfa 

FT 

DOMAIN 

1 


30 


EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 


58 


1 (POTENTIAL) . 

FT 

DOMAIN 

59 


68 


CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 


89 


2 (POTENTIAL) . 

FT 

DOMAIN 

90 


102 


EXTRACELLULAR ( POTENTIAL) . 

FT 

TRANSMEM 

103 


124 


3 (POTENTIAL) . 

FT 

DOMAIN 

125 


141 


CYTOPLASMIC ( POTENTIAL) . 

FT 

TRANSMEM 

142 


166 


4 (POTENTIAL) . 

FT 

DOMAIN 

167 


198 


EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 


218 


5 (POTENTIAL) . 

FT 

DOMAIN 

219 


235 


CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 


260 


6 (POTENTIAL) . 

FT 

DOMAIN 

261 


277 


EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 


301 


7 (POTENTIAL) . 

FT 

DOMAIN 

302 


352 


CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 


178 


BY SIMILARITY. 

FT 

MOD_RES 

3 


3 


SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 


10 


SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

14 


14 


SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 


15 


SULFATION (BY SIMILARITY) . 

FT 

VARIANT 

2 


2 


D -> E (IN ISOLATE 087). 

FT 

VARIANT 

3 


3 


Y -> D (IN ISOLATE 079) . 

FT 

VARIANT 

25 


25 


V -> G (IN ISOLATE 087) . 

FT 

VARIANT 

100 


100 


M -> K (IN ISOLATE 079) . 

FT 

VARIANT 

107 


107 


L -> V (IN ISOLATE 089) . 

FT 

VARIANT 

134 


134 


V -> G (IN ISOLATE 079). 

FT 

VARIANT 

146 


146 


V -> L (IN ISOLATES 085 AND 089). 

FT 

VARIANT 

340 


340 


T -> I (IN ISOLATE 079) . 

SQ 

SEQUENCE 

352 AA; 

40489 

MW; 20A196E2D47E49CA CRC64; 


Query Match 45.3%; 
Best Local Similarity 33.6%; 
Matches 95; Conservative 


Score 361; DB 1; Length 352; 
Pred. No. 9.9e-28; 
9; Mismatches 29; Indels 150; 


Gaps 


7; 


Qy 

Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I II I I I I I : I I I II I I I I I I I I I I I I 
1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


Qy 

Db 

Qy 

Db 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I II I 

119 1 1 LLTI DRYLAI VHAVFALKARTVT FGWT SVITWWAVFASLPGI I FTRSQREGLHYTC 178 


Qy 79 

Db 179 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 

RESULT 15 
CKR5_HYLLE 

ID CKR5_HYLLE STANDARD; PRT; 352 AA. 

AC 097883; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Hylobates leucogenys (White-cheeked gibbon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates. 

OX NCBI_TaxID=61853; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970 ; 

RA Zhang Y.-W., Ryder O.A. , Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:114 5-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF075451; AAD19863.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; PR00237; 

GPCRRHODOPSN. 

DR 

PROSITE; 

PS00237; 

G PROTEIN 

RECEP Fl 1; 1. 

DR 

PROSITE; 

PS50262; 

G_PR0TEIN_ 

~RECEP_F1_2; 1. 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotei 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL). 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 


SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I II I I : : I 

SPHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40445 

MW; 4F8E4F344CEB7C91 CRC64; 


Query Match 45.0%; Score 359; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.6e-27; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I II I I I I I I I : I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYDTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILVLINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I II I I I I I I I I I 

Db 119 IILLTIDRYLAIVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 17 9 S SHFP YSQYQFWKNFQTLKI VI LGLVLPLLVMVT CYSGI LKTLLRCRNEKKRHRAVRLI F 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I II I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


Search completed: September 17, 2004, 16:45:42 
Job time : 15.0339 sees 


